Table of Contents Author Guidelines Submit a Manuscript
The Scientific World Journal
Volume 2015 (2015), Article ID 821798, 9 pages
http://dx.doi.org/10.1155/2015/821798
Research Article

An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset

1Department of Computer Science and Engineering, University College of Engineering, Anna University, Tiruchirappalli, Tamil Nadu, India
2Department of Mathematics, College of Engineering, Anna University, Tamil Nadu, India

Received 2 June 2015; Revised 14 August 2015; Accepted 20 August 2015

Academic Editor: Juan M. Corchado

Copyright © 2015 Senthilkumar Devaraj and S. Paulraj. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. J. Read, “A pruned problem transformation method for multi-label classification,” in Proceedings of the 6th New Zealand Computer Science Research Student Conference (NZCSRSC '08), pp. 143–150, April 2008. View at Scopus
  2. M. S. Mohamad, S. Deris, S. M. Yatim, and M. R. Othman, “Feature selection method using genetic algorithm for the classification of small and high dimension data,” in Proceedings of the 1st International Symposium on Information and Communication Technology, pp. 1–4, 2004.
  3. D. Zhang, S. Chen, and Z.-H. Zhou, “Constraint score: a new filter method for feature selection with pairwise constraints,” Pattern Recognition, vol. 41, no. 5, pp. 1440–1451, 2008. View at Publisher · View at Google Scholar · View at Scopus
  4. J. Grande, M. del Rosario Suarez, and J. R. Villar, “A feature selection method using a fuzzy mutual information measure,” in Innovations in Hybrid Intelligent Systems, vol. 44 of Advances in Soft Computing, pp. 56–63, Springer, Berlin, Germany, 2007. View at Publisher · View at Google Scholar
  5. M.-L. Zhang and Z.-H. Zhou, “A review on multi-label learning algorithms,” IEEE Transactions on Knowledge and Data Engineering, vol. 26, no. 8, pp. 1819–1837, 2014. View at Publisher · View at Google Scholar · View at Scopus
  6. S. Jungjit, M. Michaelis, A. A. Freitas, and J. Cinatl, “Extending multi-label feature selection with KEGG pathway information for microarray data analysis,” in Proceedings of the IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB '14), pp. 1–8, May 2014. View at Publisher · View at Google Scholar · View at Scopus
  7. H. Lui and H. Motoda, Feature Selection for Knowledge Discovery and Data Mining, Kluwer Academic, Norwell, Mass, USA, 1998.
  8. Y. Guo and W. Xue, “Probabilistic multi-label classification with sparse feature learning,” in Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI '13), pp. 1373–1379, August 2013. View at Scopus
  9. N. Spolaôr and G. Tsoumakas, “Evaluating feature selection methods for multi-label text classification,” in Proceedings of the BioASQ Workshop, pp. 1–12, Valencia, Spain, 2013.
  10. N. Spolaôr, E. A. Cherman, M. C. Monard, and H. D. Lee, “A comparison of multi-label feature selection methods using the problem transformation approach,” Electronic Notes in Theoretical Computer Science, vol. 292, pp. 135–151, 2013. View at Publisher · View at Google Scholar · View at Scopus
  11. G. Forman, “An extensive empirical study of feature selection metrics for text classification,” Journal of Machine Learning Research, vol. 3, pp. 1289–1305, 2003. View at Google Scholar · View at Scopus
  12. H. Guo and S. Létourneau, “Iterative classification for multiple target attributes,” Journal of Intelligent Information Systems, vol. 40, no. 2, pp. 283–305, 2013. View at Publisher · View at Google Scholar · View at Scopus
  13. K. Trohidis, G. Tsoumakas, G. Kalliris, and I. Vlahavas, “Multi-label classification of music into emotions,” in Proceedings of the 9th International Conference on Music Information Retrieval (ISMIR '08), pp. 325–330, September 2008. View at Scopus
  14. H. Vafaie and I. F. Imam, “Feature selection methods: genetic algorithms vs. greedy-like search,” in Proceedings of the 3rd International Fuzzy Systems and Intelligent Control Conference, Louisville, Ky, USA, March 1994.
  15. G. Doquire and M. Verleysen, “Feature selection for multi-label classification problems,” in Advances in Computational Intelligence, vol. 6691 of Lecture Notes in Computer Science, pp. 9–16, Springer, Berlin, Germany, 2011. View at Publisher · View at Google Scholar
  16. G.-P. Liu, J.-J. Yan, Y.-Q. Wang et al., “Application of multilabel learning using the relevant feature for each label in chronic gastritis syndrome diagnosis,” Evidence-Based Complementary and Alternative Medicine, vol. 2012, Article ID 135387, 9 pages, 2012. View at Publisher · View at Google Scholar · View at Scopus
  17. M.-L. Zhang, J. M. Peña, and V. Robles, “Feature selection for multi-label naive Bayes classification,” Information Sciences, vol. 179, no. 19, pp. 3218–3229, 2009. View at Publisher · View at Google Scholar · View at Scopus
  18. C.-R. Jiang, C.-C. Liu, X. J. Zhou, and H. Huang, “Optimal ranking in multi-label classification using local precision rates,” Statistica Sinica, vol. 24, no. 4, pp. 1547–1570, 2014. View at Publisher · View at Google Scholar · View at MathSciNet
  19. Y. Yang and S. Gopal, “Multilabel classification with meta-level features in a learning-to-rank framework,” Machine Learning, vol. 88, no. 1-2, pp. 47–68, 2012. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet · View at Scopus
  20. C. Shi, X. Kong, S. Y. Philip, and B. Wang, “Multi-objective multi-label classification,” in Proceedings of the SDM, SIAM Data Mining Conference, pp. 355–366, SIAM, 2012. View at Publisher · View at Google Scholar
  21. E. Spyromitros-Xioufis, G. Tsoumakas, W. Groves, and I. Vlahavas, “Multi-label classification methods for multi-target regression,” http://arxiv.org/abs/1211.6581.
  22. S. Sangsuriyun, S. Marukatat, and K. Waiyamai, “Hierarchical multi-label associative classification (HMAC) using negative rules,” in Proceedings of the 9th IEEE International Conference on Cognitive Informatics (ICCI '10), pp. 919–924, July 2010. View at Publisher · View at Google Scholar · View at Scopus
  23. S. Zhu, X. Ji, W. Xu, and Y. Gong, “Multi-labelled classification using maximum entropy method,” in Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '05), pp. 274–281, 2005.
  24. K. Rangra and K. L. Bansal, “Comparative study of data mining tools,” International Journal of Advanced Research in Computer Science and Software Engineering, vol. 4, no. 6, pp. 216–223, 2014. View at Google Scholar
  25. T. Asha, S. Natarajan, and K. N. B. Murthy, “Data mining techniques in the diagnosis of tuberculosis,” in Understanding Tuberculosis—Global Experiences and Innovative Approaches to the Diagnosis, P.-J. Cardona, Ed., chapter 16, pp. 333–353, InTech, Rijeka, Croatia, 2012. View at Publisher · View at Google Scholar
  26. S. S. Baskar, Dr. L. Arockiam, and S. Charles, “Systematic approach on data pre-processing in data mining,” International Journal of Advanced Computer Technology, vol. 2, no. 11, pp. 335–339, 2013. View at Google Scholar
  27. X.-Y. Zhou and J. S. Lim, “EM algorithm with GMM and naive Bayesian to implement missing values,” in Advanced Science and Technology Letters, vol. 46 of Mobile and Wireless, pp. 1–5, 2014. View at Google Scholar
  28. I. Iguyon and A. Elisseeff, “An introduction to variable and feature selection,” Journal of Machine Learning Research, vol. 3, pp. 1157–1182, 2003. View at Google Scholar · View at Scopus
  29. H. Liu and L. Yu, “Toward integrating feature selection algorithms for classification and clustering,” IEEE Transactions on Knowledge and Data Engineering, vol. 17, no. 4, pp. 491–502, 2005. View at Publisher · View at Google Scholar · View at Scopus
  30. M. A. Hall, Correlation-based feature selection for machine learning [Ph.D. thesis], 1999.
  31. http://ftp.ics.uci.edu/pub/machine-learning-databases/thyroid-disease/thyroid0387.names.
  32. http://ftp.ics.uci.edu/pub/machine-learning-databases.
  33. V. Gjorgjioski, D. Kocev, and S. Džeroski, “Comparison of distances for multi-label classification with PCTs,” in Proceedings of the Slovenian KDD Conference on Data Mining and Data Warehouses (SiKDD '11), 2011.
  34. J. Liu, S. Ranka, and T. Kahveci, “Classification and feature selection algorithms for multi-class CGH data,” Bioinformatics, vol. 24, no. 13, pp. i86–i95, 2008. View at Publisher · View at Google Scholar · View at Scopus
  35. J. Han and M. Kamber, Data Mining: Concepts and Techniques, Morgan Kaufmann Publishers, 2nd edition, 2006.
  36. S. Godbole and S. Sarawagi, “Discriminative methods for multi-labeled classification,” in Advances in Knowledge Discovery and Data Mining, vol. 3056 of Lecture Notes in Computer Science, pp. 22–30, Springer, Berlin, Germany, 2004. View at Publisher · View at Google Scholar
  37. J. Read, C. Bielza, and P. Larranaga, “Multi-dimensional classification with super-classes,” IEEE Transactions on Knowledge and Data Engineering, vol. 26, no. 7, pp. 1720–1733, 2014. View at Publisher · View at Google Scholar · View at Scopus
  38. G. Madjarov, D. Kocev, D. Gjorgjevikj, and S. Džeroski, “An extensive experimental comparison of methods for multi-label learning,” Pattern Recognition, vol. 45, no. 9, pp. 3084–3104, 2012. View at Publisher · View at Google Scholar · View at Scopus
  39. K. Gao, T. Khoshgoftaar, and J. Van Hulse, “An evaluation of sampling on filter-based feature selection methods,” in Proceedings of the 23rd International Florida Artificial Intelligence Research Society Conference (FLAIRS '10), pp. 416–421, May 2010. View at Scopus
  40. T. M. Khoshgoftaar and K. Gao, “Feature selection with imbalanced data for software defect prediction,” in Proceedings of the 8th International Conference on Machine Learning and Applications (ICMLA '09), pp. 235–240, IEEE, Miami Beach, Fla, USA, December 2009. View at Publisher · View at Google Scholar · View at Scopus
  41. H. Wang, T. M. Khoshgoftaar, and K. Gao, “A comparative study of filter-based feature ranking techniques,” in Proceedings of the 11th IEEE International Conference on Information Reuse and Integration (IRI '10), pp. 43–48, August 2010. View at Publisher · View at Google Scholar · View at Scopus
  42. J. Novaković, P. Strbac, and D. Bulatović, “Toward optimal feature selection using ranking methods and classification algorithms,” Yugoslav Journal of Operations Research, vol. 21, no. 1, pp. 119–135, 2011. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  43. J. D. Evans, Straightforward Statistics for the Behavioral Sciences, Brooks/Cole Publishing, Pacific Grove, Calif, USA, 1996.
  44. N. Spolaôr, M. C. Monard, and H. D. Lee, “A systematic review to identify feature selection publications in multi-labeled data,” ICMC Technical Report 374, Universidade de São Paulo, São Paulo, Brazil, 2012. View at Google Scholar