Table of Contents Author Guidelines Submit a Manuscript
BioMed Research International
Volume 2017 (2017), Article ID 3020627, 17 pages
https://doi.org/10.1155/2017/3020627
Research Article

Robustification of Naïve Bayes Classifier and Its Application for Microarray Gene Expression Data Analysis

1Lab of Bioinformatics, Department of Statistics, University of Rajshahi, Rajshahi 6205, Bangladesh
2Department of Statistics, Begum Rokeya University, Rangpur, Rangpur 5400, Bangladesh

Correspondence should be addressed to Md. Shakil Ahmed

Received 18 March 2017; Revised 10 June 2017; Accepted 14 June 2017; Published 7 August 2017

Academic Editor: Federico Ambrogi

Copyright © 2017 Md. Shakil Ahmed et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. V. Veer, J. Laura et al., “Gene expression profiling predicts clinical outcome of breast cancer,” Nature, vol. 415, no. 6871, pp. 530–536, 2002. View at Publisher · View at Google Scholar · View at Scopus
  2. S. Dam, U. Võsa, A. van der Graaf, L. Franke, and J. P. de Magalhães, “Gene co-expression analysis for functional classification and gene–disease predictions,” Briefings in Bioinformatics, p. bbw139, 2017. View at Publisher · View at Google Scholar
  3. X. Yu, G. Yu, J. Wang, and G. N. Brock, “Clustering cancer gene expression data by projective clustering ensemble,” PLOS ONE, vol. 12, no. 2, Article ID e0171429, 2017. View at Publisher · View at Google Scholar
  4. R. K. Singh and M. Sivabalakrishnan, “Feature Selection of Gene Expression Data for Cancer Classification: A Review,” Procedia Computer Science, vol. 50, pp. 52–57, 2015. View at Google Scholar
  5. P. W. Novianti, V. L. Jong, K. C. Roes, and M. J. Eijkemans, “Meta-analysis approach as a gene selection method in class prediction: does it improve model performance? A case study in acute myeloid leukemia,” BMC Bioinformatics, vol. 18, no. 1, 2017. View at Publisher · View at Google Scholar
  6. M. Chen, K. Li, H. Li, C. Song, and Y. Miao, “The Glutathione Peroxidase Gene Family in Gossypium hirsutum: Genome-Wide Identification, Classification, Gene Expression and Functional Analysis,” Scientific Reports, vol. 7, 2017. View at Publisher · View at Google Scholar
  7. V. L. Jong, P. W. Novianti, K. C. B. Roes, and M. J. C. Eijkemans, “Selecting a classification function for class prediction with gene expression data,” Bioinformatics, vol. 32, no. 12, pp. 1814–1822, 2016. View at Publisher · View at Google Scholar · View at Scopus
  8. K. Buza, “Classification of gene expression data: a hubness-aware semi-supervised approach,” Computer Methods and Programs in Biomedicine, vol. 127, pp. 105–113, 2016. View at Publisher · View at Google Scholar
  9. L. Wang, W. K. Oh, and J. Zhu, “Disease-specific classification using deconvoluted whole blood gene expression,” Scientific Reports, vol. 6, Article ID 32976, 2016. View at Publisher · View at Google Scholar · View at Scopus
  10. L. Jiang, H. Chen, L. Pinello, and G.-C. Yuan, “GiniClust: Detecting rare cell types from single-cell gene expression data with Gini index,” Genome Biology, vol. 17, no. 1, article no. 144, 2016. View at Publisher · View at Google Scholar · View at Scopus
  11. T. R. Golub, D. K. Slonim, P. Tamayo et al., “Molecular classification of cancer: class discovery and class prediction by gene expression monitoring,” Science, vol. 286, no. 5439, pp. 531–527, 1999. View at Publisher · View at Google Scholar · View at Scopus
  12. D. Soria, J. M. Garibaldi, F. Ambrogi, E. M. Biganzoli, and I. O. Ellis, “A 'non-parametric' version of the naive Bayes classifier,” Knowledge-Based Systems, vol. 24, no. 6, pp. 775–784, 2011. View at Publisher · View at Google Scholar · View at Scopus
  13. G. W. Wright and R. M. Simon, “A random variance model for detection of differential gene expression in small microarray experiments,” Bioinformatics, vol. 19, no. 18, pp. 2448–2455, 2003. View at Publisher · View at Google Scholar · View at Scopus
  14. L. Jiang, Z. Cai, D. Wang, and H. Zhang, “Improving Tree augmented Naive Bayes for class probability estimation,” Knowledge-Based Systems, vol. 26, pp. 239–245, 2012. View at Publisher · View at Google Scholar · View at Scopus
  15. A. A. Balamurugan, R. Rajaram, S. Pramala, S. Rajalakshmi, C. Jeyendran, and J. Dinesh Surya Prakash, “NB+: An improved Naïve Bayesian algorithm,” Knowledge-Based Systems, vol. 24, no. 5, pp. 563–569, 2011. View at Publisher · View at Google Scholar · View at Scopus
  16. M. N. H. Mollah, M. Minami, and S. Eguchi, “Exploring latent structure of mixture ICA models by the minimum β-divergence method,” Neural Computation, vol. 18, no. 1, pp. 166–190, 2006. View at Publisher · View at Google Scholar · View at Scopus
  17. M. N. H. Mollah, S. Eguchi, and M. Minami, “Robust prewhitening for ICA by minimizing β-divergence and its application to FastICA,” Neural Processing Letters, vol. 25, no. 2, pp. 91–110, 2007. View at Publisher · View at Google Scholar · View at Scopus
  18. M. Nurul Haque Mollah, N. Sultana, M. Minami, and S. Eguchi, “Robust extraction of local structures by the minimum β-divergence method,” Neural Networks, vol. 23, no. 2, pp. 226–238, 2010. View at Publisher · View at Google Scholar · View at Scopus
  19. R. H. Randles, J. D. Broffitt, J. S. Ramberg, and R. V. Hogg, “Generalized linear and quadratic discriminant functions using robust estimates,” Journal of the American Statistical Association, vol. 73, no. 363, pp. 564–568, 1978. View at Publisher · View at Google Scholar · View at Scopus
  20. R. A. Maronna, “Robust M-estimators of multivariate location and scatter,” The Annals of Statistics, vol. 4, no. 1, pp. 51–67, 1976. View at Publisher · View at Google Scholar · View at MathSciNet
  21. V. Todorov and P. Neykov, “Robust selection of variables in the discriminant analysis based on the mve and mcd estimators,” in Proceedings of the Computational Statistics, COMPAST, Physica Verlag, Heidelberg, deu, 1990.
  22. V. Todorov, N. Neykov, and P. Neytchev, “Robust two-group discrimination by bounded influence regression. A Monte Carlo simulation,” Computational Statistics and Data Analysis, vol. 17, no. 3, pp. 289–302, 1994. View at Publisher · View at Google Scholar · View at Scopus
  23. V. Todorov and A. M. Pires, “Comparative performance of several robust linear discriminant analysis methods,” REVSTAT Statistical Journal, vol. 5, no. 1, pp. 63–83, 2007. View at Google Scholar · View at MathSciNet
  24. X. He and W. K. Fung, “High breakdown estimation for multiple populations with applications to discriminant analysis,” Journal of Multivariate Analysis, vol. 72, no. 2, pp. 151–162, 2000. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  25. M. Hubert and K. Van Driessen, “Fast and robust discriminant analysis,” Computational Statistics & Data Analysis, vol. 45, no. 2, pp. 301–320, 2004. View at Publisher · View at Google Scholar · View at MathSciNet
  26. D. M. Hawkins and G. J. McLachlan, “High-breakdown linear discriminant analysis,” Journal of the American Statistical Association, vol. 92, no. 437, pp. 136–143, 1997. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  27. C. Devi Arockia Vanitha, D. Devaraj, and M. Venkatesulu, “Gene expression data classification using Support Vector Machine and mutual information-based gene selection,” Procedia Computer Science, vol. 47, pp. 13–21, 2015. View at Publisher · View at Google Scholar · View at Scopus
  28. R. M. Parry, W. Jones, T. H. Stokes et al., “K-Nearest neighbor models for microarray gene expression analysis and clinical outcome prediction,” Pharmacogenomics Journal, vol. 10, no. 4, pp. 292–309, 2010. View at Publisher · View at Google Scholar · View at Scopus
  29. P. M. Long and V. B. Vega, “Boosting and microarray data,” Machine Learning, vol. 52, no. 1-2, pp. 31–44, 2003. View at Publisher · View at Google Scholar · View at Scopus
  30. N. Friedman, D. Geiger, and M. Goldszmidt, “Bayesian network classifiers,” Machine Learning, vol. 29, no. 2-3, pp. 131–163, 1997. View at Publisher · View at Google Scholar · View at Scopus
  31. M. M. H. Mollah, N. H. Mollah, and H. Kishino, “β-empirical Bayes inference and model diagnosis of microarray data,” BMC Bioinformatics, vol. 13, no. 1, p. 135, 2012. View at Publisher · View at Google Scholar · View at Scopus
  32. G. Nowak and R. Tibshirani, “Complementary hierarchical clustering,” Biostatistics, vol. 9, no. 3, pp. 467–483, 2008. View at Publisher · View at Google Scholar · View at Scopus
  33. M. A. Kuriakose, W. T. Chen, Z. M. He et al., “Selection and validation of differentially expressed genes in head and neck cancer,” Cellular and Molecular Life Sciences, vol. 61, no. 11, pp. 1372–1383, 2004. View at Publisher · View at Google Scholar · View at Scopus
  34. T. L. Bergemann and J. Wilson, “Proportion statistics to detect differentially expressed genes: A comparison with log-ratio statistics,” BMC Bioinformatics, vol. 12, article no. 228, 2011. View at Publisher · View at Google Scholar · View at Scopus
  35. R. Gottardo, Bridge: Bayesian Robust Inference for Differential Gene Expression, R package version 1.40.0, 2017.
  36. T. W. Anderson and D. A. Darling, “Asymptotic theory of certain goodness of fit criteria based on stochastic processes,” Annals of Mathematical Statistics, vol. 23, pp. 193–212, 1952. View at Publisher · View at Google Scholar · View at MathSciNet
  37. R. Thomas, L. de la Torre, X. Chang, and S. Mehrotra, “Validation and characterization of DNA microarray gene expression data distribution and associated moments,” BMC Bioinformatics, vol. 11, article no. 576, 2010. View at Publisher · View at Google Scholar · View at Scopus
  38. J. Hausser and K. Strimmer, “Entropy inference and the James-Stein estimator, with application to nonlinear gene association networks,” Journal of Machine Learning Research (JMLR), vol. 10, pp. 1469–1484, 2009. View at Google Scholar · View at MathSciNet
  39. G. E. P. Box and D. R. Cox, “An analysis of transformations,” Journal of the Royal Statistical Society. Series B, vol. 26, pp. 211–252, 1964. View at Google Scholar · View at MathSciNet