Table of Contents Author Guidelines Submit a Manuscript
The Scientific World Journal
Volume 2012 (2012), Article ID 380495, 11 pages
http://dx.doi.org/10.1100/2012/380495
Research Article

Gene Expression Profiles for Predicting Metastasis in Breast Cancer: A Cross-Study Comparison of Classification Methods

1Research Unit of Human Genetics, Institute of Clinical Research, University of Southern Denmark, Sdr. Boulevard 29, 5000 Odense C, Denmark
2Department of Clinical Genetics, Odense University Hospital, Sdr. Boulevard 29, 5000 Odense C, Denmark
3Institute of Public Health, University of Southern Denmark, J. B. Winsløws Vej 9B, 5000 Odense C, Denmark

Received 25 August 2012; Accepted 2 October 2012

Academic Editors: M. A. Kon and K. Najarian

Copyright © 2012 Mark Burton et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. E. Karlsson, U. Delle, A. Danielsson et al., “Gene expression variation to predict 10-year survival in lymph-node-negative breast cancer,” BMC Cancer, vol. 8, article 254, 2008. View at Publisher · View at Google Scholar · View at Scopus
  2. H. Y. Chuang, E. Lee, Y. T. Liu, D. Lee, and T. Ideker, “Network-based classification of breast cancer metastasis,” Molecular Systems Biology, vol. 3, article 140, 2007. View at Publisher · View at Google Scholar · View at Scopus
  3. Q. Tan, M. Thomassen, and T. A. Kruse, “Feature selection for predicting tumor metastases in microarray experiments using paired design,” Cancer Informatics, vol. 3, pp. 133–138, 2007. View at Google Scholar · View at Scopus
  4. M. Thomassen, Q. Tan, F. Eiriksdottir, M. Bak, S. Cold, and T. A. Kruse, “Prediction of metastasis from low-malignant breast cancer by gene expression profiling,” International Journal of Cancer, vol. 120, no. 5, pp. 1070–1075, 2007. View at Publisher · View at Google Scholar · View at Scopus
  5. R. Sabatier, P. Finetti, N. Cervera et al., “A gene expression signature identifies two prognostic subgroups of basal breast cancer,” Breast Cancer Research and Treatment, vol. 126, no. 2, pp. 407–420, 2011. View at Publisher · View at Google Scholar · View at Scopus
  6. L. D. Miller, J. Smeds, J. George et al., “An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival,” Proceedings of the National Academy of Sciences of the United States of America, vol. 102, no. 38, pp. 13550–13555, 2005. View at Publisher · View at Google Scholar · View at Scopus
  7. M. H. van Vliet, R. Fabien, H. M. Horlings, M. J. van de Vijver, M. J. T. Reinders, and L. F. A. Wessels, “Pooling breast cancer datasets has a synergetic effect on classification performance and improves signature stability,” BMC Genomics, vol. 9, article 375, 2008. View at Publisher · View at Google Scholar · View at Scopus
  8. M. Thomassen, Q. Tan, F. Eiriksdottir, M. Bak, S. Cold, and T. A. Kruse, “Comparison of gene sets for expression profiling: prediction of metastasis from low-malignant breast cancer,” Clinical Cancer Research, vol. 13, no. 18, part 1, pp. 5355–5360, 2007. View at Publisher · View at Google Scholar · View at Scopus
  9. N. Servant, M. A. Bollet, H. Halfwerk et al., “Search for a gene expression signature of breast cancer local recurrence in young women,” Clinical Cancer Research, vol. 18, no. 6, pp. 1704–1715, 2012. View at Publisher · View at Google Scholar · View at Scopus
  10. T. Zeng and J. Liu, “Mixture classification model based on clinical markers for breast cancer prognosis,” Artificial Intelligence in Medicine, vol. 48, no. 2-3, pp. 129–137, 2010. View at Publisher · View at Google Scholar · View at Scopus
  11. M. Garcia, R. Millat-carus, F. Bertucci, P. Finetti, D. Birnbaum, and G. Bidaut, “Interactome-transcriptome integration for predicting distant metastasis in breast cancer,” Bioinformatics, vol. 28, no. 5, pp. 672–678, 2012. View at Publisher · View at Google Scholar · View at Scopus
  12. R. Díaz-Uriarte and A. S. de Andrés, “Gene selection and classification of microarray data using random forest,” BMC Bioinformatics, vol. 7, article 3, 2006. View at Publisher · View at Google Scholar · View at Scopus
  13. L. J. Lancashire, D. G. Powe, J. S. Reis-Filho et al., “A validated gene expression profile for detecting clinical outcome in breast cancer using artificial neural networks,” Breast Cancer Research and Treatment, vol. 120, no. 1, pp. 83–93, 2010. View at Publisher · View at Google Scholar · View at Scopus
  14. A. Statnikov and C. F. Aliferis, “Are random forests better than support vector machines for microarray-based cancer classification?” AMIA Annual Symposium Proceedings, pp. 686–690, 2007. View at Google Scholar · View at Scopus
  15. M. Pirooznia, J. Y. Yang, M. Q. Qu, and Y. Deng, “A comparative study of different machine learning methods on microarray gene expression data,” BMC Genomics, vol. 9, supplement 1, article S13, 2008. View at Publisher · View at Google Scholar · View at Scopus
  16. M. Zucknick, S. Richardson, and E. A. Stronach, “Comparing the characteristics of gene expression profiles derived by univariate and multivariate classification methods,” Statistical Applications in Genetics and Molecular Biology, vol. 7, no. 1, article 7, 2008. View at Google Scholar · View at Scopus
  17. A. Statnikov, L. Wang, and C. F. Aliferis, “A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification,” BMC Bioinformatics, vol. 9, article 319, 2008. View at Publisher · View at Google Scholar · View at Scopus
  18. J. Önskog, E. Freyhult, M. Landfors, P. Rydén, and T. R. Hvidsten, “Classification of microarrays; synergistic effects between normalization, gene selection and machine learning,” BMC Bioinformatics, vol. 12, article 390, 2011. View at Publisher · View at Google Scholar · View at Scopus
  19. S. Y. Kim, “Effects of sample size on robustness and prediction accuracy of a prognostic gene signature,” BMC Bioinformatics, vol. 10, article 147, 2009. View at Publisher · View at Google Scholar · View at Scopus
  20. Y. Hoshida, “Nearest template prediction: a single-sample-based flexible class prediction with confidence assessment,” PLoS One, vol. 5, no. 11, Article ID e15543, 2010. View at Publisher · View at Google Scholar · View at Scopus
  21. M. Chen, L. Shi, R. Kelly, R. Perkins, H. Fang, and W. Tong, “Selecting a single model or combining multiple models for microarray-based classifier development?—a comparative analysis based on large and diverse datasets generated from the MAQC-II project,” BMC Bioinformatics, vol. 12, supplement 10, Article ID S3, 2011. View at Publisher · View at Google Scholar · View at Scopus
  22. M. Zervakis, M. E. Blazadonakis, G. Tsiliki, V. Danilatou, M. Tsiknakis, and D. Kafetzopoulos, “Outcome prediction based on microarray analysis: a critical perspective on methods,” BMC Bioinformatics, vol. 10, article 53, 2009. View at Publisher · View at Google Scholar · View at Scopus
  23. S. Calza, P. Hall, G. Auer et al., “Intrinsic molecular signature of breast cancer in a population-based cohort of 412 patients,” Breast Cancer Research, vol. 8, no. 4, article R34, 2006. View at Publisher · View at Google Scholar · View at Scopus
  24. C. Sotiriou, S. Y. Neo, L. M. McShane et al., “Breast cancer classification and prognosis based on gene expression profiles from a population-based study,” Proceedings of the National Academy of Sciences of the United States of America, vol. 100, no. 18, pp. 10393–10398, 2003. View at Publisher · View at Google Scholar · View at Scopus
  25. C. Sotiriou, P. Wirapati, S. Loi et al., “Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis,” Journal of the National Cancer Institute, vol. 98, no. 4, pp. 262–272, 2006. View at Publisher · View at Google Scholar · View at Scopus
  26. M. J. van de Vijver, Y. D. He, L. J. van't Veer et al., “A gene-expression signature as a predictor of survival in breast cancer,” New England Journal of Medicine, vol. 347, no. 25, pp. 1999–2009, 2002. View at Publisher · View at Google Scholar · View at Scopus
  27. E. Huang, S. H. Cheng, H. Dressman et al., “Gene expression predictors of breast cancer outcomes,” The Lancet, vol. 361, no. 9369, pp. 1590–1596, 2003. View at Publisher · View at Google Scholar · View at Scopus
  28. Y. Wang, J. G. M. Klijn, Y. Zhang et al., “Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer,” The Lancet, vol. 365, no. 9460, pp. 671–679, 2005. View at Publisher · View at Google Scholar · View at Scopus
  29. M. Buyse, S. Loi, L. van't Veer et al., “Validation and clinical utility of a 70-gene prognostic signature for women with node-negative breast cancer,” Journal of the National Cancer Institute, vol. 98, no. 17, pp. 1183–1192, 2006. View at Publisher · View at Google Scholar · View at Scopus
  30. M. Schmidt, D. Böhm, C. von Törne et al., “The humoral immune system has a key prognostic impact in node-negative breast cancer,” Cancer Research, vol. 68, no. 13, pp. 5405–5413, 2008. View at Publisher · View at Google Scholar · View at Scopus
  31. L. J. van't Veer, H. Dai, M. J. van de Vijver et al., “Gene expression profiling predicts clinical outcome of breast cancer,” Nature, vol. 415, no. 6871, pp. 530–536, 2002. View at Publisher · View at Google Scholar · View at Scopus
  32. C. Desmedt, F. Piette, S. Loi et al., “Strong time dependence of the 76-gene prognostic signature for node-negative breast cancer patients in the TRANSBIG multicenter independent validation series,” Clinical Cancer Research, vol. 13, no. 11, pp. 3207–3214, 2007. View at Publisher · View at Google Scholar · View at Scopus
  33. L. Nanni, S. Brahnam, and A. Lumini, “Combining multiple approaches for gene microarray classification,” Bioinformatics, vol. 28, no. 8, pp. 1151–1157, 2012. View at Publisher · View at Google Scholar · View at Scopus
  34. M. Thomassen, Q. Tan, and T. A. Kruse, “Gene expression meta-analysis identifies metastatic pathways and transcription factors in breast cancer,” BMC Cancer, vol. 8, article 394, 2008. View at Publisher · View at Google Scholar · View at Scopus
  35. M. Thomassen, Q. Tan, and T. A. Kruse, “Gene expression meta-analysis identifies chromosomal regions and candidate genes involved in breast cancer metastasis,” Breast Cancer Research and Treatment, vol. 113, no. 2, pp. 239–249, 2009. View at Publisher · View at Google Scholar · View at Scopus
  36. L. Breiman, “Random forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, 2001. View at Publisher · View at Google Scholar · View at Scopus
  37. T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining Inference and Prediction, Springer, New York, NY, USA, 2nd edition, 2009.
  38. V. Vapnik, The Nature of Statistical Learning Theory, Springer, New York, Ny, USA, 1995.
  39. A. Dupuy and R. M. Simon, “Critical review of published microarray studies for cancer outcome and guidelines on statistical analysis and reporting,” Journal of the National Cancer Institute, vol. 99, no. 2, pp. 147–157, 2007. View at Publisher · View at Google Scholar · View at Scopus
  40. R. Hewett and P. Kijsanayothin, “Tumor classification ranking from microarray data,” BMC Genomics, vol. 9, supplement 2, article S21, 2008. View at Publisher · View at Google Scholar · View at Scopus
  41. G. Natsoulis, L. El Ghaoui, G. R. G. Lanckriet et al., “Classification of a large microarray data set: algorithm comparison and analysis of drug signatures,” Genome Research, vol. 15, no. 5, pp. 724–736, 2005. View at Publisher · View at Google Scholar · View at Scopus
  42. Y. Peng, “A novel ensemble machine learning for robust microarray data classification,” Computers in Biology and Medicine, vol. 36, no. 6, pp. 553–573, 2006. View at Publisher · View at Google Scholar · View at Scopus
  43. C. J. Xu, H. C. J. Hoefsloot, and A. K. Smilde, “To aggregate or not to aggregate high-dimensional classifiers,” BMC Bioinformatics, vol. 12, article 153, 2011. View at Publisher · View at Google Scholar · View at Scopus
  44. S. L. Taylor and K. Kim, “A jackknife and voting classifier approach to feature selection and classification,” Cancer Informatics, vol. 10, pp. 133–147, 2011. View at Publisher · View at Google Scholar · View at Scopus
  45. A. Statnikov, C. F. Aliferis, I. Tsamardinos, D. Hardin, and S. Levy, “A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis,” Bioinformatics, vol. 21, no. 5, pp. 631–643, 2005. View at Publisher · View at Google Scholar · View at Scopus
  46. A. C. Tan and D. Gilbert, “Ensemble machine learning on gene expression data for cancer classification,” Appl Bioinformatics, vol. 2, supplement 3, pp. S75–S83, 2003. View at Google Scholar · View at Scopus
  47. M. S. Sewak, N. P. Reddy, and Z. H. Duan, “Gene expression based leukemia sub-classification using committee neural networks,” Bioinformatics and Biology Insights, vol. 3, pp. 89–98, 2009. View at Google Scholar
  48. L. Breiman, “Bagging predictors,” Machine Learning, vol. 24, no. 2, pp. 123–140, 1996. View at Google Scholar · View at Scopus
  49. X. Wang and O. Gotoh, “Accurate molecular classification of cancer using simple rules,” BMC Medical Genomics, vol. 2, article 64, 2009. View at Google Scholar · View at Scopus
  50. H. T. Huynh, J. J. Kim, and Y. Won, “Performance comparison of SLFN training algorithms for DNA microarray classification,” Advances in Experimental Medicine and Biology, vol. 696, pp. 135–143, 2011. View at Publisher · View at Google Scholar · View at Scopus