Table of Contents Author Guidelines Submit a Manuscript
Advances in Bioinformatics
Volume 2016, Article ID 5670851, 6 pages
http://dx.doi.org/10.1155/2016/5670851
Research Article

Feature Selection Has a Large Impact on One-Class Classification Accuracy for MicroRNAs in Plants

1Computer Science, The College of Sakhnin, 30810 Sakhnin, Israel
2The Institute of Applied Research, The Galilee Society, P.O. Box 437, 20200 Shefa Amr, Israel
3Molecular Biology and Genetics, Izmir Institute of Technology, Urla, 35430 Izmir, Turkey
4Bionia Incorporated, IZTEKGEB A8, Urla, 35430 Izmir, Turkey

Received 31 October 2015; Accepted 16 March 2016

Academic Editor: Paul Harrison

Copyright © 2016 Malik Yousef et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. R. C. Lee, R. L. Feinbaum, and V. Ambros, “The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14,” Cell, vol. 75, no. 5, pp. 843–854, 1993. View at Publisher · View at Google Scholar · View at Scopus
  2. V. N. Kim, J. Han, and M. C. Siomi, “Biogenesis of small RNAs in animals,” Nature Reviews Molecular Cell Biology, vol. 10, no. 2, pp. 126–139, 2009. View at Publisher · View at Google Scholar · View at Scopus
  3. K. U. Tüfekci, M. G. Öner, R. L. J. Meuwissen, and Ş. Genç, “The role of microRNAs in human diseases,” Methods in Molecular Biology, vol. 1107, pp. 33–50, 2014. View at Publisher · View at Google Scholar · View at Scopus
  4. E. J. Chapman and J. C. Carrington, “Specialization and evolution of endogenous small RNA pathways,” Nature Reviews Genetics, vol. 8, no. 11, pp. 884–896, 2007. View at Publisher · View at Google Scholar · View at Scopus
  5. B. Alural, G. A. Duran, K. U. Tufekci et al., “EPO mediates neurotrophic, neuroprotective, anti-oxidant, and anti-apoptotic effects via downregulation of miR-451 and miR-885-5p in SH-SY5Y neuron-like cells,” Frontiers in Immunology, vol. 5, article 475, 2014. View at Publisher · View at Google Scholar · View at Scopus
  6. B. Alural, A. Ozerdem, J. Allmer, K. Genc, and S. Genc, “Lithium protects against paraquat neurotoxicity by NRF2 activation and miR-34a inhibition in SH-SY5Y cells,” Frontiers in Cellular Neuroscience, vol. 9, article 209, 2015. View at Publisher · View at Google Scholar · View at Scopus
  7. Z. Zhang, J. Yu, D. Li et al., “PMRD: plant microRNA database,” Nucleic Acids Research, vol. 38, no. 1, Article ID gkp818, pp. D806–D813, 2009. View at Publisher · View at Google Scholar · View at Scopus
  8. S. Griffiths-Jones, H. K. Saini, S. van Dongen, and A. J. Enright, “miRBase: tools for microRNA genomics,” Nucleic Acids Research, vol. 36, no. 1, pp. D154–D158, 2008. View at Publisher · View at Google Scholar · View at Scopus
  9. S.-D. Hsu, F.-M. Lin, W.-Y. Wu et al., “MiRTarBase: a database curates experimentally validated microRNA-target interactions,” Nucleic Acids Research, vol. 39, no. 1, pp. D163–D169, 2011. View at Publisher · View at Google Scholar · View at Scopus
  10. M. Yousef, S. Jung, L. C. Showe, and M. K. Showe, “Learning from positive examples when the negative class is undetermined—microRNA gene identification,” Algorithms for Molecular Biology, vol. 3, article 2, 2008. View at Publisher · View at Google Scholar · View at Scopus
  11. J. Ding, S. Zhou, and J. Guan, “MiRenSVM: towards better prediction of microRNA precursors using an ensemble SVM classifier with multi-loop features,” BMC Bioinformatics, vol. 11, supplement 11, p. S11, 2010. View at Publisher · View at Google Scholar · View at Scopus
  12. Y. Wu, B. Wei, H. Liu, T. Li, and S. Rayner, “MiRPara: a SVM-based software tool for prediction of most probable microRNA coding regions in genome scale sequences,” BMC Bioinformatics, vol. 12, no. 1, article 107, 2011. View at Publisher · View at Google Scholar · View at Scopus
  13. W. Ritchie, D. Gao, and J. E. J. Rasko, “Defining and providing robust controls for microRNA prediction,” Bioinformatics, vol. 28, no. 8, pp. 1058–1061, 2012. View at Publisher · View at Google Scholar · View at Scopus
  14. M. D. Saçar and J. Allmer, “Current limitations for computational analysis of miRNAs in cancer,” Pakistan Journal of Clinical and Biomedical Research, vol. 1, no. 2, pp. 3–5, 2013. View at Google Scholar
  15. M. D. Saçar, H. Hamzeiy, and J. Allmer, “Can MiRBase provide positive data for machine learning for the detection of MiRNA hairpins?” Journal of Integrative Bioinformatics, vol. 10, no. 2, article 215, 2013. View at Google Scholar · View at Scopus
  16. M. Yousef, N. Najami, and W. Khalifav, “A comparison study between one-class and two-class machine learning for MicroRNA target detection,” Journal of Biomedical Science and Engineering, vol. 3, no. 3, pp. 247–252, 2010. View at Publisher · View at Google Scholar
  17. M. D. Saçar and J. Allmer, “Machine learning methods for microRNA gene prediction,” Methods in Molecular Biology, vol. 1107, pp. 177–187, 2014. View at Publisher · View at Google Scholar · View at Scopus
  18. M. D. Saçar and J. Allmer, “Comparison of four Ab initio MicroRNA prediction tools,” in Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms (BIOINFORMATICS '13), pp. 190–195, Barcelona, Spain, February 2013. View at Scopus
  19. K. L. S. Ng and S. K. Mishra, “De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures,” Bioinformatics, vol. 23, no. 11, pp. 1321–1330, 2007. View at Publisher · View at Google Scholar · View at Scopus
  20. L. Wei, M. Liao, Y. Gao, R. Ji, Z. He, and Q. Zou, “Improved and promising identificationof human microRNAs by incorporatinga high-quality negative set,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 11, no. 1, pp. 192–201, 2014. View at Publisher · View at Google Scholar · View at Scopus
  21. L. M. Manevitz and M. Yousef, “One-class SVMs for document classification,” Journal of Machine Learning Research, vol. 2, pp. 139–154, 2002. View at Google Scholar
  22. L. Manevitz and M. Yousef, “One-class document classification via Neural Networks,” Neurocomputing, vol. 70, no. 7–9, pp. 1466–1481, 2007. View at Publisher · View at Google Scholar · View at Scopus
  23. M. Yousef, J. Allmer, and W. Khalifa, “Accurate plant MicroRNA prediction can be achieved using sequence motif features,” Journal of Intelligent Learning Systems and Applications, vol. 8, no. 1, pp. 9–22, 2016. View at Publisher · View at Google Scholar
  24. M. D. Saçar and J. Allmer, “Data mining for microrna gene prediction: on the impact of class imbalance and feature number for microrna gene prediction,” in Proceedings of the 8th International Symposium on Health Informatics and Bioinformatics (HIBIT '13), pp. 1–6, IEEE, Ankara,Turkey, September 2013. View at Publisher · View at Google Scholar · View at Scopus
  25. E. Amaldi and V. Kann, “On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems,” Theoretical Computer Science, vol. 209, no. 1-2, pp. 237–260, 1998. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  26. S. Paul, M. Magdon-Ismail, and P. Drineas, “Feature selection for linear SVM with provable guarantees,” Journal of Machine Learning Research, vol. 38, pp. 735–743, 2015. View at Google Scholar
  27. I. Guyon, J. Weston, S. Barnhill, and V. Vapnik, “Gene selection for cancer classification using support vector machines,” Machine Learning, vol. 46, no. 1–3, pp. 389–422, 2002. View at Publisher · View at Google Scholar · View at Scopus
  28. M. E. Ahsen, N. K. Singh, T. Boren, M. Vidyasagar, and M. A. White, “A new feature selection algorithm for two-class classification problems and application to endometrial cancer,” in Proceedings of the 51st IEEE Conference on Decision and Control (CDC '12), pp. 2976–2982, Maui, Hawaii, USA, December 2012. View at Publisher · View at Google Scholar · View at Scopus
  29. L. H. N. Lorena, A. C. P. L. F. Carvalho, and A. C. Lorena, “Filter feature selection for one-class classification,” Journal of Intelligent & Robotic Systems, vol. 80, supplement 1, pp. 227–243, 2015. View at Publisher · View at Google Scholar
  30. P. Xuan, M. Guo, Y. Huang, W. Li, and Y. Huang, “Maturepred: efficient identification of microRNAs within novel plant Pre-miRNAs,” PLoS ONE, vol. 6, no. 11, Article ID e27422, 2011. View at Publisher · View at Google Scholar · View at Scopus
  31. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten, “The WEKA data mining software,” ACM SIGKDD Explorations Newsletter, vol. 11, no. 1, pp. 10–18, 2009. View at Publisher · View at Google Scholar
  32. P. Xuan, M. Guo, X. Liu, Y. Huang, W. Li, and Y. Huang, “PlantMiRNAPred: efficient classification of real and pseudo plant pre-miRNAs,” Bioinformatics, vol. 27, no. 10, pp. 1368–1376, 2011. View at Publisher · View at Google Scholar · View at Scopus
  33. M. Yousef, J. Allmer, and W. Khalifa, “Sequence motif-based one-class classifiers can achieve comparable accuracy to two-class learners for plant microRNA detection,” Journal of Biomedical Science and Engineering, vol. 8, no. 10, pp. 684–694, 2015. View at Publisher · View at Google Scholar
  34. M. D. Saçar, C. Bağcı, and J. Allmer, “Computational prediction of MicroRNAs from toxoplasma gondii potentially regulating the hosts' gene expression,” Genomics, Proteomics and Bioinformatics, vol. 12, no. 5, pp. 228–238, 2014. View at Publisher · View at Google Scholar · View at Scopus
  35. D. M. J. Tax, DDTools, the Data Description Toolbox for Matlab, 2015.
  36. M. R. Berthold, N. Cebron, F. Dill et al., “KNIME: the Konstanz Information Miner,” in Data Analysis, Machine Learning and Applications, C. Preisach, H. Burkhardt, L. Schmidt-Thieme, and R. Decker, Eds., Studies in Classification, Data Analysis, and Knowledge Organization, pp. 319–326, 2008. View at Publisher · View at Google Scholar
  37. I. Letunic and P. Bork, “Interactive Tree of Life v2: online annotation and display of phylogenetic trees made easy,” Nucleic Acids Research, vol. 39, supplement 2, pp. W475–W478, 2011. View at Publisher · View at Google Scholar · View at Scopus
  38. P. Jiang, H. Wu, W. Wang, W. Ma, X. Sun, and Z. Lu, “MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features,” Nucleic Acids Research, vol. 35, no. 2, pp. W339–W344, 2007. View at Publisher · View at Google Scholar · View at Scopus
  39. R. Batuwita and V. Palade, “microPred: effective classification of pre-miRNAs for human miRNA gene prediction,” Bioinformatics, vol. 25, no. 8, pp. 989–995, 2009. View at Publisher · View at Google Scholar · View at Scopus
  40. J. Allmer, “Computational and bioinformatics methods for microRNA gene prediction,” Methods in Molecular Biology, vol. 1107, pp. 157–175, 2014. View at Publisher · View at Google Scholar · View at Scopus
  41. Q. Zou, J. Zeng, L. Cao, and R. Ji, “A novel features ranking metric with application to scalable visual and bioinformatics data classification,” Neurocomputing, vol. 173, part 2, pp. 346–354, 2016. View at Publisher · View at Google Scholar · View at Scopus
  42. X. Zeng, Y. Liao, Y. Liu, and Q. Zou, “Prediction and validation of disease genes using HeteSim Scores,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2016. View at Publisher · View at Google Scholar