Table of Contents Author Guidelines Submit a Manuscript
BioMed Research International
Volume 2017, Article ID 2437608, 11 pages
https://doi.org/10.1155/2017/2437608
Research Article

Metabolomic Biomarker Identification in Presence of Outliers and Missing Values

1Bioinformatics Lab, Department of Statistics, Rajshahi University, Rajshahi, Bangladesh
2Department of Statistics, Bangabandhu Sheikh Mujibur Rahman Science and Technology University, Gopalganj, Bangladesh
3Department of Statistics, Begum Rokeya University, Rangpur, Bangladesh
4Institute of Biological Sciences, Rajshahi University, Rajshahi, Bangladesh

Correspondence should be addressed to Nishith Kumar; moc.liamg@90urb.kn

Received 24 October 2016; Revised 15 December 2016; Accepted 18 January 2017; Published 14 February 2017

Academic Editor: Peter J. Oefner

Copyright © 2017 Nishith Kumar et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. A. Alaiya, M. Al-Mohanna, and S. Linder, “Clinical cancer proteomics: promises and pitfalls,” Journal of Proteome Research, vol. 4, no. 4, pp. 1213–1222, 2005. View at Publisher · View at Google Scholar · View at Scopus
  2. A. D. Westont and L. Hood, “Systems biology, proteomics, and the future of health care: toward predictive, preventative, and personalized medicine,” Journal of Proteome Research, vol. 3, no. 2, pp. 179–196, 2004. View at Publisher · View at Google Scholar · View at Scopus
  3. K. Jung, T. Friede, and T. Beißbarth, “Reporting FDR analogous confidence intervals for the log fold change of differentially expressed genes,” BMC Bioinformatics, vol. 12, no. 1, article 288, 2011. View at Publisher · View at Google Scholar · View at Scopus
  4. W. Li, “Volcano plots in analyzing differential expressions with mRNA microarrays,” Journal of Bioinformatics and Computational Biology, vol. 10, no. 6, Article ID 1231003, 2012. View at Publisher · View at Google Scholar · View at Scopus
  5. S. Zhang and J. Cao, “A close examination of double filtering with fold change and t test in microarray analysis,” BMC Bioinformatics, vol. 10, article 402, 2009. View at Publisher · View at Google Scholar · View at Scopus
  6. M. Gevrey, I. Dimopoulos, and S. Lek, “Review and comparison of methods to study the contribution of variables in artificial neural network models,” Ecological Modelling, vol. 160, no. 3, pp. 249–264, 2003. View at Publisher · View at Google Scholar · View at Scopus
  7. A. Rakotomamonjy, “Variable selection using SVM-based criteria,” Journal of Machine Learning Research, vol. 3, no. 7-8, pp. 1357–1370, 2003. View at Publisher · View at Google Scholar · View at MathSciNet
  8. A. B. Ishak and B. Ghattas, “An efficient method for variable selection using svm-based criteria,” Journal of Machine Learning Research, vol. 6, pp. 1357–1370, 2005. View at Google Scholar
  9. T. Aittokallio, “Dealing with missing values in large-scale studies: microarray data imputation and beyond,” Briefings in Bioinformatics, vol. 11, no. 2, pp. 253–264, 2009. View at Publisher · View at Google Scholar · View at Scopus
  10. H. Park, T. Shimamura, S. Miyano, and S. Imoto, “Robust prediction of anti-cancer drug sensitivity and sensitivity-specific biomarker,” PLoS ONE, vol. 9, no. 10, Article ID e108990, 2014. View at Publisher · View at Google Scholar · View at Scopus
  11. P. Gromski, Y. Xu, H. Kotze et al., “Influence of missing values substitutes on multivariate analysis of metabolomics data,” Metabolites, vol. 4, no. 2, pp. 433–452, 2014. View at Publisher · View at Google Scholar
  12. O. Hrydziuszko and M. R. Viant, “Missing values in mass spectrometry based metabolomics: an undervalued step in the data processing pipeline,” Metabolomics, vol. 8, no. 1, pp. 161–174, 2012. View at Publisher · View at Google Scholar · View at Scopus
  13. J. L. Schafer and J. W. Graham, “Missing data: our view of the state of the art,” Psychological Methods, vol. 7, no. 2, pp. 147–177, 2002. View at Publisher · View at Google Scholar · View at Scopus
  14. L. Blanchet and A. Smolinska, “Data fusion in metabolomics and proteomics for biomarker discovery,” Methods in Molecular Biology, vol. 1362, pp. 209–223, 2016. View at Publisher · View at Google Scholar
  15. O. Troyanskaya, M. Cantor, G. Sherlock et al., “Missing value estimation methods for DNA microarrays,” Bioinformatics, vol. 17, no. 6, pp. 520–525, 2001. View at Publisher · View at Google Scholar · View at Scopus
  16. D. J. Stekhoven and P. Bühlmann, “Missforest—Non-parametric missing value imputation for mixed-type data,” Bioinformatics, vol. 28, no. 1, pp. 112–118, 2012. View at Publisher · View at Google Scholar · View at Scopus
  17. J. Xia, N. Psychogios, N. Young, and D. S. Wishart, “MetaboAnalyst: a web server for metabolomic data analysis and interpretation,” Nucleic Acids Research, vol. 37, supplement 2, pp. W652–W660, 2009. View at Publisher · View at Google Scholar · View at Scopus
  18. W. Stacklies, H. Redestig, M. Scholz, D. Walther, and J. Selbig, “pcaMethods—a bioconductor package providing PCA methods for incomplete data,” Bioinformatics, vol. 23, no. 9, pp. 1164–1167, 2007. View at Publisher · View at Google Scholar · View at Scopus
  19. S. Oba, M.-A. Sato, I. Takemasa, M. Monden, K.-I. Matsubara, and S. Ishii, “A Bayesian missing value estimation method for gene expression profile data,” Bioinformatics, vol. 19, no. 16, pp. 2088–2096, 2003. View at Publisher · View at Google Scholar · View at Scopus
  20. T. H. Lin, “A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data,” Quality & Quantity, vol. 44, no. 2, pp. 277–287, 2010. View at Publisher · View at Google Scholar · View at Scopus
  21. D. M. Hawkins, L. Liu, and S. S. Young, “Robust singular value decomposition,” National Institute of Statistical Science Technical Report no. 122, 2001. View at Google Scholar
  22. L. Breiman, “Random forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, 2001. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  23. R. Tibshirani and T. Hastie, “Outlier sums for differential gene expression analysis,” Biostatistics, vol. 8, no. 1, pp. 2–8, 2007. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  24. C. Cortes and V. Vapnik, “Support-vector networks,” Machine Learning, vol. 20, no. 3, pp. 273–297, 1995. View at Publisher · View at Google Scholar · View at Scopus
  25. A. D. Patterson, O. Maurhofer, D. Beyoğlu et al., “Aberrant lipid metabolism in hepatocellular carcinoma revealed by plasma metabolomics and lipid profiling,” Cancer Research, vol. 71, no. 21, pp. 6590–6600, 2011. View at Publisher · View at Google Scholar · View at Scopus