Table of Contents Author Guidelines Submit a Manuscript
Applied Computational Intelligence and Soft Computing
Volume 2016, Article ID 1465810, 13 pages
http://dx.doi.org/10.1155/2016/1465810
Research Article

Online Incremental Learning for High Bandwidth Network Traffic Classification

Faculty of Electrical Engineering, Universiti Teknologi Malaysia, 81310 Johor Bahru, Johor, Malaysia

Received 31 October 2015; Revised 27 January 2016; Accepted 31 January 2016

Academic Editor: Jun He

Copyright © 2016 H. R. Loo et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. H. Abdulsalam, Streaming random forest [Ph.D. Dissertation], School of Computing, Queen's University, Kingston, Canada, 2008.
  2. C. C. Aggarwal, J. Han, J. Wang, and P. S. Yu, “On demand classification of data streams,” in Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '04), pp. 503–508, ACM, Seattle, Wash, USA, August 2004. View at Scopus
  3. P. Domingos and G. Hulten, “Mining high-speed data streams,” in Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '00), pp. 71–80, ACM, New York, NY, USA, 2000. View at Publisher · View at Google Scholar
  4. G. Hulten, L. Spencer, and P. Domingos, “Mining time-changing data streams,” in Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '01), pp. 97–106, ACM, San Francisco, Calif, USA, August 2001. View at Publisher · View at Google Scholar
  5. W. N. Street and Y. Kim, “A streaming ensemble algorithm (SEA) for large-scale classification,” in Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '01), pp. 377–382, ACM, San Francisco, Calif, USA, August 2001. View at Scopus
  6. H. Wang, W. Fan, P. S. Yu, and J. Han, “Mining concept-drifting data streams using ensemble classifiers,” in Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '03), pp. 226–235, ACM, Washington, DC, USA, August 2003. View at Publisher · View at Google Scholar · View at Scopus
  7. X. Tian, Q. Sun, X. Huang, and Y. Ma, “Dynamic online traffic classification using data stream mining,” in Proceedings of the International Conference on MultiMedia and Information Technology (MMIT '08), pp. 104–107, Zigui, China, December 2008. View at Publisher · View at Google Scholar
  8. G. Mingliang, H. Xiaohong, T. Xu, M. Yan, and W. Zhenhua, “Data stream mining based real-time highspeed traffic classification,” in Proceedings of the 2nd IEEE International Conference on Broadband Network & Multimedia Technology (IEEE IC-BNMT '09), pp. 700–705, Beijing, China, October 2009. View at Publisher · View at Google Scholar · View at Scopus
  9. B. Raahemi, W. Zhong, and J. Liu, “Peer-to-peer traffic identification by mining IP layer data streams using concept-adapting very fast decision tree,” in Proceedings of the 20th IEEE International Conference on Tools with Artificial Intelligence (ICTAI '08), pp. 525–532, IEEE, Dayton, Ohio, USA, November 2008. View at Publisher · View at Google Scholar · View at Scopus
  10. S. Li, Towards ultra high-speed online network traffic classifcation enhanced with machine learning algorithms and openflow accelerators [Ph.D. thesis], University of Massachusetts Lowell, 2012.
  11. M. Zhang, H. Zhang, B. Zhang, and G. Lu, “Encrypted traffic classification based on an improved clustering algorithm,” in Trustworthy Computing and Services, vol. 320 of Communications in Computer and Information Science, pp. 124–131, Springer, Berlin, Germany, 2013. View at Publisher · View at Google Scholar
  12. M. M. Masud, C. Woolam, J. Gao et al., “Facing the reality of data stream classification: coping with scarcity of labeled data,” Knowledge and Information Systems, vol. 33, no. 1, pp. 213–244, 2012. View at Publisher · View at Google Scholar · View at Scopus
  13. M. Estlick, M. Leeser, J. Theiler, and J. J. Szymanski, “Algorithmic transformations in the implementation of K-means clustering on reconfigurable hardware,” in Proceedings of the 9th ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA '01), pp. 103–110, ACM, Monterey, Calif, USA, February 2001. View at Publisher · View at Google Scholar
  14. A. Moore, D. Zuev, and M. Crogan, “Discriminators for use in flow-based classification,” Tech. Rep., Department of Computer Science, Queen Mary University of London, London, UK, 2005. View at Google Scholar
  15. F. Gringoli, L. Salgarelli, M. Dusi, N. Cascarano, F. Risso, and K. C. Claffy, “GT: picking up the truth from the ground for internet traffic,” ACM SIGCOMM Computer Communication Review, vol. 39, no. 5, pp. 12–18, 2009. View at Publisher · View at Google Scholar
  16. H. R. Loo and M. N. Marsono, “Online data stream classification with incremental semi-supervised learning,” in Proceedings of the 2nd IKDD Conference on Data Science (CODS '15), pp. 132–133, ACM, Bangalore, India, March 2015. View at Publisher · View at Google Scholar
  17. G. Münz, S. Li, and G. Carle, “Traffic anomaly detection using k-means clustering,” in Proceedings of the GI/ITG Workshop MMBnet, September 2007.
  18. J. Erman, M. Arlitt, and A. Mahanti, “Traffic classification using clustering algorithms,” in Proceedings of the SIGCOMM Workshop on Mining Network Data (MineNet '06), pp. 281–286, ACM, Pisa, Italy, September 2006. View at Publisher · View at Google Scholar
  19. G. Maiolini, G. Molina, A. Baiocchi, and A. Rizzi, “On the fly application flows identification by exploiting K-means based classifiers,” Journal of Information Assurance and Security, vol. 4, no. 2, pp. 142–150, 2009. View at Google Scholar
  20. L. Yingqiu, L. Wei, and L. Yunchun, “Network traffic classification using K-means clustering,” in Proceedings of the 2nd International Multi-Symposiums on Computer and Computational Sciences (IMSCCS '07), pp. 360–365, IEEE, Iowa City, Iowa, USA, August 2007. View at Publisher · View at Google Scholar · View at Scopus
  21. C. Yang and B. Huang, “Traffic classification using an improved clustering algorithm,” in Proceedings of the International Conference on Communications, Circuits and Systems (ICCCAS '08), pp. 515–518, IEEE, Fujian, China, May 2008. View at Publisher · View at Google Scholar · View at Scopus
  22. T. Zhang, R. Ramakrishnan, and M. Livny, “BIRCH: an efficient data clustering method for very large databases,” ACM SIGMOD Record, vol. 25, no. 2, pp. 103–114, 1996. View at Publisher · View at Google Scholar · View at Scopus
  23. C. C. Aggarwal, J. Han, J. Wang, and P. S. Yu, “A framework for clustering evolving data streams,” in Proceedings of the 29th International Conference on Very Large Data Bases (VLDB '03), vol. 29, pp. 81–92, VLDB Endowment, 2003.
  24. Z. Li, J.-G. Lee, X. Li, and J. Han, “Incremental clustering for trajectories,” in Database Systems for Advanced Applications, vol. 5982 of Lecture Notes in Computer Science, pp. 32–46, Springer, Berlin, Germany, 2010. View at Publisher · View at Google Scholar
  25. S. Lühr and M. Lazarescu, “Incremental clustering of dynamic data streams using connectivity based representative points,” Data & Knowledge Engineering, vol. 68, no. 1, pp. 1–27, 2009. View at Publisher · View at Google Scholar · View at Scopus
  26. M. Ester, H.-P. Kriegel, J. Sander, M. Wimmer, and X. Xu, “Incremental clustering for mining in a data warehousing environment,” in Proceedings of the 24rd International Conference on Very Large Data Bases (VLDB '98), pp. 323–333, August 1998.
  27. K. Burbeck and S. Nadjm-Tehrani, “Adaptive real-time anomaly detection with incremental clustering,” Information Security Technical Report, vol. 12, no. 1, pp. 56–67, 2007. View at Publisher · View at Google Scholar · View at Scopus
  28. W. Jiang and M. Gokhale, “Real-time classification of multimedia traffic using FPGA,” in Proceedings of the International Conference on Field Programmable Logic and Applications (FPL '10), pp. 56–63, IEEE, Milan, Italy, September 2010. View at Publisher · View at Google Scholar · View at Scopus
  29. F. Winterstein, S. Bayliss, and G. A. Constantinides, “FPGA-based K-means clustering using tree-based data structures,” in Proceedings of the 23rd International Conference on Field Programmable Logic and Applications (FPL '13), pp. 1–6, IEEE, Porto, Portugal, September 2013. View at Publisher · View at Google Scholar · View at Scopus
  30. T. Maruyama, “Real-time K-means clustering for color images on reconfigurable hardware,” in Proceedings of the 18th International Conference on Pattern Recognition (ICPR '06), vol. 2, pp. 816–819, IEEE, Hong Kong, August 2006. View at Publisher · View at Google Scholar · View at Scopus
  31. A. G. S. Filho, A. C. Frery, C. C. de Araújo et al., “Hyperspectral images clustering on reconfigurable hardware using the k-means algorithm,” in Proceedings of the 16th Symposium on Integrated Circuits and Systems Design (SBCCI '03), pp. 99–104, IEEE, São Paulo, Brazil, September 2003. View at Publisher · View at Google Scholar
  32. J. Singaraju and J. A. Chandy, “Active storage networks for accelerating K-means data clustering,” in Reconfigurable Computing: Architectures, Tools and Applications, vol. 6578 of Lecture Notes in Computer Science, pp. 102–109, Springer, Berlin, Germany, 2011. View at Publisher · View at Google Scholar
  33. Z. Lin, C. Lo, and P. Chow, “K-means implementation on FPGA for high-dimensional data using triangle inequality,” in Proceedings of the 22nd International Conference on Field Programmable Logic and Applications (FPL '12), pp. 437–442, Oslo, Norway, August 2012. View at Publisher · View at Google Scholar · View at Scopus
  34. D. Lavenier, “Fpga implementation of the k-means clustering algorithm for hyperspectral images,” in Los Alamos National Laboratory, Citeseer, Laur, Philippines, 2000. View at Google Scholar
  35. V. Bhaskaran, Parametrized implementation of K-means clustering on reconfigurable systems [Ph.D. Dissertation], University of Tennessee, Knoxville, Tenn, USA, 2003.
  36. J. S. S. Kutty, F. Boussaid, and A. Amira, “A high speed configurable FPGA architecture for k-mean clustering,” in Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS '13), pp. 1801–1804, IEEE, Beijing, China, May 2013. View at Publisher · View at Google Scholar · View at Scopus
  37. H. R. Loo, T. Andromeda, and M. N. Marsono, “Online data stream learning and classification with limited labels,” Proceeding of the Electrical Engineering Computer Science and Informatics, vol. 1, no. 1, pp. 161–164, 2014. View at Google Scholar
  38. A. Monemi, R. Zarei, and M. N. Marsono, “Online NetFPGA decision tree statistical traffic classifier,” Computer Communications, vol. 36, no. 12, pp. 1329–1340, 2013. View at Publisher · View at Google Scholar · View at Scopus
  39. A. Bifet, G. Holmes, R. Kirkby, and B. Pfahringer, “MOA: massive online analysis,” The Journal of Machine Learning Research, vol. 11, pp. 1601–1604, 2010. View at Google Scholar · View at Scopus
  40. J. Gama, P. Medas, G. Castillo, and P. Rodrigues, “Learning with drift detection,” in Advances in Artificial Intelligence-SBIA 2004: 17th Brazilian Symposium on Artificial Intelligence, Sao Luis, Maranhao, Brazil, September 29-Ocotber 1, 2004. Proceedings, vol. 3171 of Lecture Notes in Computer Science, pp. 286–295, Springer, Berlin, Germany, 2004. View at Publisher · View at Google Scholar
  41. Waikato, MOA Massive Online Analysis, 2015, http://moa.cs.waikato.ac.nz/.