Shengyong Ding, Shiwu Min, Yongbing Fan. A Large Scale NetFlow Analysis System Based on Spark[J]. Telecommunications science, 2014, 30(10): 48-51. DOI: 10.3969/j.issn.1000-0801.2014.10.009.
The existing systems usually adopt private distributed architectures
which face scalability
openness
cost and latency problems. The development of big data technology such as Spark offers new opportunity for large scale NetFlow processing systems. A new analysis system based on Spark platform was proposed and the effectiveness of the method was verified. The experimental results show its superior performance.
关键词
Keywords
references
White T . Hadoop: the Definitive Guide . O'Reilly Media Inc , 2012
Zaharia M , Chowdhury M , Das T , et al . Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing . Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation , San Jose, CA, USA , 2012
Rossi D , Silvio V . Fine-grained traffic classification with NetFlow data . Proceedings of the 6th International Wireless Communications and Mobile Computing Conference , Shenzhen, China , 2010