Table of Contents Author Guidelines Submit a Manuscript
Computational Intelligence and Neuroscience
Volume 2016, Article ID 5423204, 10 pages
http://dx.doi.org/10.1155/2016/5423204
Research Article

Imbalanced Learning Based on Logistic Discrimination

1School of Computer and Information Technology, Xinyang Normal University, Xinyang 464000, China
2School of Information Engineering, Zhengzhou University, Zhengzhou 450000, China

Received 10 July 2015; Revised 23 October 2015; Accepted 26 October 2015

Academic Editor: José David Martín-Guerrero

Copyright © 2016 Huaping Guo et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. H. He and E. A. Garcia, “Learning from imbalanced data,” IEEE Transactions on Knowledge and Data Engineering, vol. 21, no. 9, pp. 1263–1284, 2009. View at Publisher · View at Google Scholar · View at Scopus
  2. H. He and Y. Ma, Imbalanced Learning: Foundations, Algorithms, and Applications, Wiley-IEEE Press, New York, NY, USA, 2013. View at Publisher · View at Google Scholar
  3. P. Yao, Z. Wang, H. Jiang, and Z. Liu, “Fault diagnosis method based on CS-boosting for unbalanced training data,” Journal of Vibration, Measurement & Diagnosis, vol. 33, no. 1, pp. 111–115, 2013. View at Google Scholar · View at Scopus
  4. X.-Y. Liu, Q.-Q. Li, and Z.-H. Zhou, “Learning imbalanced multi-class data with optimal dichotomy weights,” in Proceedings of the 13th IEEE International Conference on Data Mining (ICDM '13), pp. 478–487, Dallas, Tex, USA, December 2013. View at Publisher · View at Google Scholar · View at Scopus
  5. P. D. Martin, “Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation,” Journal of Machine Learning Technologies, vol. 2, no. 1, pp. 37–63, 2011. View at Google Scholar
  6. X.-Y. Liu, J. X. Wu, and Z.-H. Zhou, “Exploratory under-sampling for class-imbalance learning,” in Proceedings of the 6th International Conference on Data Mining (ICDM '06), pp. 965–969, IEEE, Hong Kong, December 2006. View at Publisher · View at Google Scholar · View at Scopus
  7. N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, “SMOTE: synthetic minority over-sampling technique,” Journal of Artificial Intelligence Research, vol. 16, pp. 321–357, 2002. View at Google Scholar · View at Scopus
  8. J. Zhu and E. H. Hovy, “Active learning for word sense disambiguation with methods for addressing the class imbalance problem,” in Proceedings of the Joint Meeting of Conference on Empirical Methods in Natural Language Processing and Conference on Computational Natural Language Learning (EMNLP-CoNLL '07), vol. 7, pp. 783–790, Prague, Czech Republic, June 2007.
  9. S. Ertekin, “Adaptive oversampling for imbalanced data classification,” in Information Sciences and Systems, pp. 261–269, Springer, 2013. View at Publisher · View at Google Scholar
  10. B. X. Wang and N. Japkowicz, “Boosting support vector machines for imbalanced data sets,” Knowledge and Information Systems, vol. 25, no. 1, pp. 1–20, 2010. View at Publisher · View at Google Scholar · View at Scopus
  11. M. V. Joshi, R. C. Agarwal, and V. Kumar, “Mining needles in a haystack: classifying rare classes via two-phase rule induction,” in Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 91–102, Santa Barbara, Calif, USA, May 2001.
  12. G. M. Weiss, “Mining with rarity: a unifying framework,” ACM SIGKDD Explorations Newsletter, vol. 6, no. 1, pp. 7–19, 2004. View at Publisher · View at Google Scholar
  13. A. P. Sinha and J. H. May, “Evaluating and tuning predictive data mining models using receiver operating characteristic curves,” Journal of Management Information Systems, vol. 21, no. 3, pp. 249–280, 2004. View at Google Scholar · View at Scopus
  14. Y. M. Sun, M. S. Kamel, A. K. C. Wong, and Y. Wang, “Cost-sensitive boosting for classification of imbalanced data,” Pattern Recognition, vol. 40, no. 12, pp. 3358–3378, 2007. View at Publisher · View at Google Scholar · View at Scopus
  15. N. Japkowicz and S. Stephen, “The class imbalance problem: a systematic study,” Intelligent Data Analysis, vol. 6, no. 5, pp. 429–450, 2002. View at Google Scholar · View at Scopus
  16. K. P. Murphy, Machine Learning: A Probabilistic Perspective, MIT Press, Cambridge, Mass, USA, 2012.
  17. T. Jo and N. Japkowicz, “Class imbalances versus small disjuncts,” ACM SIGKDD Explorations Newsletter, vol. 6, no. 1, pp. 40–49, 2004. View at Publisher · View at Google Scholar
  18. C. G. Varassin, A. Plastino, H. C. D. G. Leitão, and B. Zadrozny, “Undersampling strategy based on clustering to improve the performance of splice site classification in human genes,” in Proceedings of the 24th International Workshop on Database and Expert Systems Applications (DEXA '13), pp. 85–89, Los Alamitos, Calif, USA, August 2013. View at Publisher · View at Google Scholar
  19. Y. Mi, “Imbalanced classification based on active learning SMOTE,” Research Journal of Applied Sciences, Engineering and Technology, vol. 5, no. 3, pp. 944–949, 2013. View at Google Scholar · View at Scopus
  20. P. Cao, D. Zhao, and O. R. Zaïane, “A PSO-based cost-sensitive neural network for imbalanced data classification,” in Trends and Applications in Knowledge Discovery and Data Mining, vol. 7867 of Lecture Notes in Computer Science, pp. 452–463, Springer, Berlin, Germany, 2013. View at Publisher · View at Google Scholar
  21. R. Alejo, V. Garcia, J. M. Sotoca, R. A. Mollineda, and J. S. Sanchez, “Improving the performance of the RBF neural networks trained with imbalanced samples,” in Computational and Ambient Intelligence, vol. 4507 of Lecture Notes in Computer Science, pp. 162–169, Springer, Berlin, Germany, 2007. View at Publisher · View at Google Scholar
  22. P. E. Gill, W. Murray, and M. H. Wright, Practical Optimization, Academic Press, New York, NY, USA, 1982. View at MathSciNet
  23. P. E. Gill and W. Murray, “Minimization subject to bounds on the variables,” NPL Report NAC72, 1976. View at Google Scholar
  24. E. K. P. Chong and S. H. Zak, An Introduction to Optimization, John Wiley & Sons, 4th edition, 2013.
  25. C. Blake and C. Merz, UCI repository of machine learning databases, http://www.ics.uci.edu/mlearn/MLRepository.html.
  26. J. V. Hulse, T. M. Khoshgoftaar, and A. Napolitano, “Experimental perspectives on learning from imbalanced data,” in Proceedings of the 24th International Conference on Machine Learning (ICML '07), pp. 935–942, Corvallis, Ore, USA, June 2007. View at Publisher · View at Google Scholar · View at Scopus
  27. N. Garcia-Pddrajas, N. C. Garcia-Osorio, and C. Fyfe, “Nonlinear boosting projections for ensemble construction,” Journal of Machine Learning Research, vol. 8, pp. 1–33, 2007. View at Google Scholar
  28. M. A. Maloof, “Learning when data sets are imbalanced and when costs are unequal and unknown,” in Proceedings of the Workshop on Learning from Imbalanced Data Sets II (ICML '03), vol. 2, pp. 1–2, Washington, DC, USA, August 2003.
  29. J. Demsar, “Statistical comparisons of classifiers over multiple data sets,” Journal of Machine Learning Research, vol. 6, pp. 1–30, 2006. View at Google Scholar · View at MathSciNet
  30. S. García, A. Fernández, J. Luengo, and F. Herrera, “Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: experimental analysis of power,” Information Sciences, vol. 180, no. 10, pp. 2044–2064, 2010. View at Publisher · View at Google Scholar · View at Scopus