Table of Contents Author Guidelines Submit a Manuscript
The Scientific World Journal
Volume 2013 (2013), Article ID 875450, 9 pages
http://dx.doi.org/10.1155/2013/875450
Research Article

Iterative Nearest Neighborhood Oversampling in Semisupervised Learning from Imbalanced Data

School of Software, Dalian University of Technology, Dalian 116620, China

Received 12 May 2013; Accepted 4 June 2013

Academic Editors: P. Melin and J. Pavón

Copyright © 2013 Fengqi Li et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. O. Chapelle, B. Schökopf, and A. Zien, Eds., Semi-Supervised Learning, MIT Press, Cambridge, Mass, USA, 2006.
  2. S. Li, Z. Wang, G. Zhou, and S. Y. M. Lee, “Semi-supervised learning for imbalanced sentiment classification,” in Proceedings of the 22nd International Joint Conference on Artificial Intelligence, vol. 3, pp. 1826–1831, AAAI Press, Barcelona, Spain, 2011.
  3. L. Zhuang and H. Dai, “Parameter estimation of one-class SVM on imbalance text classification,” Advances in Artificial Intelligence, vol. 4013, pp. 538–549, 2006. View at Google Scholar
  4. C. Phua, V. Lee, K. Smith, and R. Gayler, “A comprehensive survey of data mining-based fraud detection research,” Artificial Intelligence Review, 2005. View at Google Scholar
  5. R. Sommer and V. Paxson, “Outside the closed world: on using machine learning for network intrusion detection,” in Proceedings of the 31st IEEE Symposium on Security and Privacy (SP '10), pp. 305–316, IEEE CS Press, May 2010. View at Publisher · View at Google Scholar · View at Scopus
  6. P. Radivojac, N. V. Chawla, A. K. Dunker, and Z. Obradovic, “Classification and knowledge discovery in protein databases,” Journal of Biomedical Informatics, vol. 37, no. 4, pp. 224–239, 2004. View at Publisher · View at Google Scholar · View at Scopus
  7. P. Chan and S. Stolfo, “Toward scalable learning with non-uniform class and cost distributions,” in Proceedings of the International Conference on Knowledge Discovery and Data Mining, pp. 164–168, 1998.
  8. A. Estabrooks, T. Jo, and N. Japkowicz, “A multiple resampling method for learning from imbalanced data sets,” Computational Intelligence, vol. 20, no. 1, pp. 18–36, 2004. View at Google Scholar · View at Scopus
  9. K. M. Ting, “An instance-weighting method to induce cost-sensitive trees,” IEEE Transactions on Knowledge and Data Engineering, vol. 14, no. 3, pp. 659–665, 2002. View at Publisher · View at Google Scholar · View at Scopus
  10. X. Hong, S. Chen, and C. J. Harris, “A kernel-based two-class classifier for imbalanced data sets,” IEEE Transactions on Neural Networks, vol. 18, no. 1, pp. 28–41, 2007. View at Publisher · View at Google Scholar · View at Scopus
  11. M. Li, H. Zhang, R. Wu, and Z.-H. Zhou, “Sample-based software defect prediction with active and semi-supervised learning,” Automated Software Engineering, vol. 19, no. 2, pp. 201–230, 2012. View at Publisher · View at Google Scholar · View at Scopus
  12. S. Ertekin, J. Huang, L. Bottou, and C. Lee Giles, “Learning on the border: active learning in imbalanced data classification,” in Proceedings of the 16th ACM Conference on Information and Knowledge Management (CIKM '07), pp. 127–136, November 2007. View at Publisher · View at Google Scholar · View at Scopus
  13. S. Barua, M. Isam, X. Yao, and K. Murase, “MWMOTE—majority weighted minority oversampling technique for imbalaced data set learning,” IEEE Transactions on Knowledge and Data Engineering, 2012. View at Google Scholar
  14. N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, “SMOTE: synthetic minority over-sampling technique,” Journal of Artificial Intelligence Research, vol. 16, pp. 321–357, 2002. View at Google Scholar · View at Scopus
  15. X.-Y. Liu, J. Wu, and Z.-H. Zhou, “Exploratory under-sampling for class-imbalance learning,” in Proceedings of the 6th International Conference on Data Mining (ICDM '06), pp. 965–969, December 2006. View at Publisher · View at Google Scholar · View at Scopus
  16. J. Wang, T. Jebara, and S.-F. Chang, “Graph transduction via alternating minimization,” in Proceedings of the 25th International Conference on Machine Learning, pp. 1144–1151, July 2008. View at Scopus
  17. J. Wang, Y.-G. Jiang, and S.-F. Chang, “Label diagnosis through self tuning for web image search,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPR '09), pp. 1390–1397, June 2009. View at Publisher · View at Google Scholar · View at Scopus
  18. M. Plessis and M. Sugiyama, “Semi-supervised learning of class balance under class-prior change by distribution matching,” in Proceedings of the International Conference on Machine Learning, pp. 823–830, 2012.
  19. W. Liu, J. Wang, and S.-F. Chang, “Robust and scalable graph-based semisupervised learning,” Proceedings of the IEEE, vol. 100, no. 9, pp. 2624–2638, 2012. View at Google Scholar
  20. X. Zhu, Z. Ghahramani, and J. Lafferty, “Semi-supervised learning using Gaussian fields and harmonic functions,” in Proceedings of the 20th International Conference on Machine Learning, pp. 912–919, August 2003. View at Scopus
  21. D. Zhou, O. Bousquet, T. N. Lal, J. Weston, and B. Schölkopf, “Learning with local and global consistency,” in Advances in Neural Information Processing Systems, vol. 16, pp. 321–328, MIT Press, Cambridge, Mass, USA, 2004. View at Google Scholar
  22. M. Belkin, P. Niyogi, and V. Sindhwani, “Manifold regularization: a geometric framework for learning from labeled and unlabeled examples,” Journal of Machine Learning Research, vol. 7, pp. 2399–2434, 2006. View at Google Scholar · View at Scopus
  23. D. Tao, C. Xu, L. Yang, and X. S. Hua, “Ensemble manifold regularization,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, pp. 1227–1223, 2012. View at Google Scholar
  24. Y. Yang, F. Nie, D. Xu, J. Luo, Y. Zhuang, and Y. Pan, “A multimedia retrieval framework based on semi-supervised ranking and relevance feedback,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, no. 4, pp. 723–742, 2012. View at Publisher · View at Google Scholar · View at Scopus