Table of Contents Author Guidelines Submit a Manuscript
Mathematical Problems in Engineering
Volume 2015, Article ID 939175, 17 pages
http://dx.doi.org/10.1155/2015/939175
Research Article

Data Stream Classification Based on the Gamma Classifier

1Neural Networks and Unconventional Computing Lab/Alpha-Beta Group, Centro de Investigación en Computación, Instituto Politécnico Nacional, Avenida Juan de Dios Bátiz, Colonia Nuevo Industrial Vallejo, Delegación Gustavo A. Madero, 07738 Mexico City, DF, Mexico
2Intelligent Computing Lab/Alpha-Beta Group, Centro de Innovación y Desarrollo Tecnológico en Cómputo, Instituto Politécnico Nacional, Avenida Juan de Dios Bátiz, Colonia Nuevo Industrial Vallejo, Delegación Gustavo A. Madero, 07700 Mexico City, DF, Mexico
3LIAAD-INESC INESC TEC and Faculty of Economics, University of Porto, Rua Dr. Roberto Frias 378, 4200-378 Porto, Portugal

Received 13 April 2015; Revised 7 July 2015; Accepted 29 July 2015

Academic Editor: Konstantinos Karamanos

Copyright © 2015 Abril Valeria Uriarte-Arcia et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. V. Turner, J. F. Gantz, D. Reinsel, and S. Minton, “The digital universe of opportunities: rich data and the increasing value of the internet of things,” Tech. Rep., IDC Information and Data, 2014, http://www.emc.com/leadership/digital-universe/2014iview/index.htm. View at Google Scholar
  2. A. Bifet, Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams, IOS Press, Amsterdam, The Netherlands, 2010.
  3. J. Gama, Knowledge Discovery from Data Streams, Chapman & Hall/CRC, 2010. View at MathSciNet
  4. J. Beringer and E. Hüllermeier, “Efficient instance-based learning on data streams,” Intelligent Data Analysis, vol. 11, no. 6, pp. 627–650, 2007. View at Google Scholar · View at Scopus
  5. J. Gama, I. Žliobaitė, A. Bifet, M. Pechenizkiy, and A. Bouchachia, “A survey on concept drift adaptation,” ACM Computing Surveys, vol. 46, no. 4, article 44, 2014. View at Publisher · View at Google Scholar · View at Scopus
  6. P. Domingos and G. Hulten, “Mining high-speed data streams,” in Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 71–80, Boston, Mass, USA, August 2000. View at Scopus
  7. G. Hulten, L. Spencer, and P. Domingos, “Mining time-changing data streams,” in Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '01), pp. 97–106, San Francisco, Calif, USA, August 2001. View at Scopus
  8. C. Liang, Y. Zhang, P. Shi, and Z. Hu, “Learning very fast decision tree from uncertain data streams with positive and unlabeled samples,” Information Sciences, vol. 213, pp. 50–67, 2012. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  9. L. Rutkowski, M. Jaworski, L. Pietruczuk, and P. Duda, “Decision trees for mining data streams based on the gaussian approximation,” IEEE Transactions on Knowledge and Data Engineering, vol. 26, no. 1, pp. 108–119, 2014. View at Publisher · View at Google Scholar · View at Scopus
  10. G. Widmer and M. Kubat, “Learning in the presence of concept drift and hidden contexts,” Machine Learning, vol. 23, no. 1, pp. 69–101, 1996. View at Google Scholar · View at Scopus
  11. F. Ferrer-Troyano, J. S. Aguilar-Ruiz, and J. C. Riquelme, “Data streams classification by incremental rule learning with parameterized generalization,” in Proceedings of the 2006 ACM Symposium on Applied Computing, pp. 657–661, Dijon, France, April 2006.
  12. D. Mena-Torres and J. S. Aguilar-Ruiz, “A similarity-based approach for data stream classification,” Expert Systems with Applications, vol. 41, no. 9, pp. 4224–4234, 2014. View at Publisher · View at Google Scholar · View at Scopus
  13. A. Shaker and E. Hüllermeier, “IBLStreams: a system for instance-based classification and regression on data streams,” Evolving Systems, vol. 3, no. 4, pp. 235–249, 2012. View at Publisher · View at Google Scholar · View at Scopus
  14. A. Ghazikhani, R. Monsefi, and H. S. Yazdi, “Recursive least square perceptron model for non-stationary and imbalanced data stream classification,” Evolving Systems, vol. 4, no. 2, pp. 119–131, 2013. View at Publisher · View at Google Scholar · View at Scopus
  15. A. Ghazikhani, R. Monsefi, and H. S. Yazdi, “Ensemble of online neural networks for non-stationary and imbalanced data streams,” Neurocomputing, vol. 122, pp. 535–544, 2013. View at Publisher · View at Google Scholar · View at Scopus
  16. A. Sancho-Asensio, A. Orriols-Puig, and E. Golobardes, “Robust on-line neural learning classifier system for data stream classification tasks,” Soft Computing, vol. 18, no. 8, pp. 1441–1461, 2014. View at Publisher · View at Google Scholar · View at Scopus
  17. D. M. Farid, L. Zhang, A. Hossain et al., “An adaptive ensemble classifier for mining concept drifting data streams,” Expert Systems with Applications, vol. 40, no. 15, pp. 5895–5906, 2013. View at Publisher · View at Google Scholar · View at Scopus
  18. D. Brzezinski and J. Stefanowski, “Combining block-based and online methods in learning ensembles from concept drifting data streams,” Information Sciences, vol. 265, pp. 50–67, 2014. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  19. G. Ditzler and R. Polikar, “Incremental learning of concept drift from streaming imbalanced data,” IEEE Transactions on Knowledge and Data Engineering, vol. 25, no. 10, pp. 2283–2301, 2013. View at Publisher · View at Google Scholar · View at Scopus
  20. M. J. Hosseini, Z. Ahmadi, and H. Beigy, “Using a classifier pool in accuracy based tracking of recurring concepts in data stream classification,” Evolving Systems, vol. 4, no. 1, pp. 43–60, 2013. View at Publisher · View at Google Scholar · View at Scopus
  21. N. Saunier and S. Midenet, “Creating ensemble classifiers through order and incremental data selection in a stream,” Pattern Analysis and Applications, vol. 16, no. 3, pp. 333–347, 2013. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  22. M. M. Masud, J. Gao, L. Khan, J. Han, and B. M. Thuraisingham, “Classification and novel class detection in concept-drifting data streams under time constraints,” IEEE Transactions on Knowledge and Data Engineering, vol. 23, no. 6, pp. 859–874, 2011. View at Publisher · View at Google Scholar · View at Scopus
  23. A. Alzghoul, M. Löfstrand, and B. Backe, “Data stream forecasting for system fault prediction,” Computers and Industrial Engineering, vol. 62, no. 4, pp. 972–978, 2012. View at Publisher · View at Google Scholar · View at Scopus
  24. J. Kranjc, J. Smailović, V. Podpečan, M. Grčar, M. Žnidaršič, and N. Lavrač, “Active learning for sentiment analysis on data streams: methodology and workflow implementation in the ClowdFlows platform,” Information Processing and Management, vol. 51, no. 2, pp. 187–203, 2015. View at Publisher · View at Google Scholar · View at Scopus
  25. J. Smailović, M. Grčar, N. Lavrač, and M. Žnidaršič, “Stream-based active learning for sentiment analysis in the financial domain,” Information Sciences, vol. 285, pp. 181–203, 2014. View at Publisher · View at Google Scholar · View at Scopus
  26. S. Shalev-Shwartz, Y. Singer, N. Srebro, and A. Cotter, “Pegasos: primal estimated sub-gradient solver for SVM,” in Proceedings of the 24th International Conference on Machine Learning, pp. 807–814, June 2007. View at Publisher · View at Google Scholar · View at Scopus
  27. Z. Miller, B. Dickinson, W. Deitrick, W. Hu, and A. H. Wang, “Twitter spammer detection using data stream clustering,” Information Sciences, vol. 260, pp. 64–73, 2014. View at Publisher · View at Google Scholar · View at Scopus
  28. B. Krawczyk and M. Woźniak, “Incremental weighted one-class classifier for mining stationary data streams,” Journal of Computational Science, vol. 9, pp. 19–25, 2015. View at Publisher · View at Google Scholar
  29. J. Gama, “A survey on learning from data streams: current and future trends,” Progress in Artificial Intelligence, vol. 1, no. 1, pp. 45–55, 2012. View at Publisher · View at Google Scholar
  30. L. I. Kuncheva, “Change detection in streaming multivariate data using likelihood detectors,” IEEE Transactions on Knowledge and Data Engineering, vol. 25, no. 5, pp. 1175–1180, 2013. View at Publisher · View at Google Scholar · View at Scopus
  31. G. J. Ross, N. M. Adams, D. K. Tasoulis, and D. J. Hand, “Exponentially weighted moving average charts for detecting concept drift,” Pattern Recognition Letters, vol. 33, no. 2, pp. 191–198, 2012. View at Publisher · View at Google Scholar · View at Scopus
  32. J. Gama, P. Medas, G. Castillo, and P. Rodrigues, “Learning with drift detection,” in Advances in Artificial Intelligence—SBIA 2004, vol. 3171 of Lecture Notes in Computer Science, pp. 286–295, Springer, Berlin, Germany, 2004. View at Publisher · View at Google Scholar
  33. X. Wu, P. Li, and X. Hu, “Learning from concept drifting data streams with unlabeled data,” Neurocomputing, vol. 92, pp. 145–155, 2012. View at Publisher · View at Google Scholar · View at Scopus
  34. X. Qin, Y. Zhang, C. Li, and X. Li, “Learning from data streams with only positive and unlabeled data,” Journal of Intelligent Information Systems, vol. 40, no. 3, pp. 405–430, 2013. View at Publisher · View at Google Scholar · View at Scopus
  35. M. M. Masud, C. Woolam, J. Gao et al., “Facing the reality of data stream classification: coping with scarcity of labeled data,” Knowledge and Information Systems, vol. 33, no. 1, pp. 213–244, 2012. View at Publisher · View at Google Scholar · View at Scopus
  36. J. Read, A. Bifet, G. Holmes, and B. Pfahringer, “Scalable and efficient multi-label classification for evolving data streams,” Machine Learning, vol. 88, no. 1-2, pp. 243–272, 2012. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  37. I. López-Yáñez, A. J. Argüelles-Cruz, O. Camacho-Nieto, and C. Yáñez-Márquez, “Pollutants time-series prediction using the gamma classifier,” International Journal of Computational Intelligence Systems, vol. 4, no. 4, pp. 680–711, 2011. View at Google Scholar · View at Scopus
  38. M. E. Acevedo-Mosqueda, C. Yáñez-Márquez, and I. López-Yáñez, “Alpha-beta bidirectional associative memories: theory and applications,” Neural Processing Letters, vol. 26, no. 1, pp. 1–40, 2007. View at Publisher · View at Google Scholar · View at Scopus
  39. C. Yáñez-Márquez, E. M. Felipe-Riverón, I. López-Yáñez, and R. Flores-Carapia, “A novel approach to automatic color matching,” in Progress in Pattern Recognition, Image Analysis and Applications, vol. 4225 of Lecture Notes in Computer Science, pp. 529–538, Springer, Berlin, Germany, 2006. View at Publisher · View at Google Scholar
  40. I. López-Yáñez, L. Sheremetov, and C. Yáñez-Márquez, “A novel associative model for time series data mining,” Pattern Recognition Letters, vol. 41, no. 1, pp. 23–33, 2014. View at Publisher · View at Google Scholar · View at Scopus
  41. A. Bifet, G. Holmes, R. Kirkby, and B. Pfahringer, “Data stream mining: a practical approach,” Tech. Rep., University of Waikato, 2011. View at Google Scholar
  42. Massive Online Analysis (MOA), 2014, http://moa.cms.waikato.ac.nz/datasets/.
  43. K. Bache and M. Lichman, UCI Machine Learning Repository, School of Information and Computer Science, University of California, Irvine, Calif, USA, 2013, http://archive.ics.uci.edu/ml.
  44. A. Bifet, R. Kirkby, P. Kranen, and P. Reutemann, “Massive Online Analysis Manual,” 2012, http://moa.cms.waikato.ac.nz/documentation/.
  45. M. Harries, “Splice-2 comparative evaluation: electricity pricing,” Tech. Rep., University of South Wales, Cardiff, UK, 1999. View at Google Scholar
  46. R. Agrawal, T. Imielinski, and A. Swami, “Database mining: a performance perspective,” IEEE Transactions on Knowledge and Data Engineering, vol. 5, no. 6, pp. 914–925, 1993. View at Publisher · View at Google Scholar · View at Scopus
  47. J. Gama, R. Sebastião, and P. P. Rodrigues, “On evaluating stream learning algorithms,” Machine Learning, vol. 90, no. 3, pp. 317–346, 2013. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  48. F. Wilcoxon, “Individual comparisons by ranking methods,” Biometrics Bulletin, vol. 1, no. 6, pp. 80–83, 1945. View at Publisher · View at Google Scholar
  49. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten, WEKA 3: Data Mining Software in Java, 2010, http://www.cs.waikato.ac.nz/ml/weka/.