The Scientific World Journal
Volume 2012 (2012), Article ID 380495, 11 pages
http://dx.doi.org/10.1100/2012/380495
Research Article
Gene Expression Profiles for Predicting Metastasis in Breast Cancer: A Cross-Study Comparison of Classification Methods
1Research Unit of Human Genetics, Institute of Clinical Research, University of Southern Denmark, Sdr. Boulevard 29, 5000 Odense C, Denmark
2Department of Clinical Genetics, Odense University Hospital, Sdr. Boulevard 29, 5000 Odense C, Denmark
3Institute of Public Health, University of Southern Denmark, J. B. Winsløws Vej 9B, 5000 Odense C, Denmark
Received 25 August 2012; Accepted 2 October 2012
Academic Editors: M. A. Kon and K. Najarian
Copyright © 2012 Mark Burton et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Linked References
- E. Karlsson, U. Delle, A. Danielsson et al., “Gene expression variation to predict 10-year survival in lymph-node-negative breast cancer,” BMC Cancer, vol. 8, article 254, 2008. View at Publisher · View at Google Scholar · View at Scopus
- H. Y. Chuang, E. Lee, Y. T. Liu, D. Lee, and T. Ideker, “Network-based classification of breast cancer metastasis,” Molecular Systems Biology, vol. 3, article 140, 2007. View at Publisher · View at Google Scholar · View at Scopus
- Q. Tan, M. Thomassen, and T. A. Kruse, “Feature selection for predicting tumor metastases in microarray experiments using paired design,” Cancer Informatics, vol. 3, pp. 133–138, 2007. View at Google Scholar · View at Scopus
- M. Thomassen, Q. Tan, F. Eiriksdottir, M. Bak, S. Cold, and T. A. Kruse, “Prediction of metastasis from low-malignant breast cancer by gene expression profiling,” International Journal of Cancer, vol. 120, no. 5, pp. 1070–1075, 2007. View at Publisher · View at Google Scholar · View at Scopus
- R. Sabatier, P. Finetti, N. Cervera et al., “A gene expression signature identifies two prognostic subgroups of basal breast cancer,” Breast Cancer Research and Treatment, vol. 126, no. 2, pp. 407–420, 2011. View at Publisher · View at Google Scholar · View at Scopus
- L. D. Miller, J. Smeds, J. George et al., “An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival,” Proceedings of the National Academy of Sciences of the United States of America, vol. 102, no. 38, pp. 13550–13555, 2005. View at Publisher · View at Google Scholar · View at Scopus
- M. H. van Vliet, R. Fabien, H. M. Horlings, M. J. van de Vijver, M. J. T. Reinders, and L. F. A. Wessels, “Pooling breast cancer datasets has a synergetic effect on classification performance and improves signature stability,” BMC Genomics, vol. 9, article 375, 2008. View at Publisher · View at Google Scholar · View at Scopus
- M. Thomassen, Q. Tan, F. Eiriksdottir, M. Bak, S. Cold, and T. A. Kruse, “Comparison of gene sets for expression profiling: prediction of metastasis from low-malignant breast cancer,” Clinical Cancer Research, vol. 13, no. 18, part 1, pp. 5355–5360, 2007. View at Publisher · View at Google Scholar · View at Scopus
- N. Servant, M. A. Bollet, H. Halfwerk et al., “Search for a gene expression signature of breast cancer local recurrence in young women,” Clinical Cancer Research, vol. 18, no. 6, pp. 1704–1715, 2012. View at Publisher · View at Google Scholar · View at Scopus
- T. Zeng and J. Liu, “Mixture classification model based on clinical markers for breast cancer prognosis,” Artificial Intelligence in Medicine, vol. 48, no. 2-3, pp. 129–137, 2010. View at Publisher · View at Google Scholar · View at Scopus
- M. Garcia, R. Millat-carus, F. Bertucci, P. Finetti, D. Birnbaum, and G. Bidaut, “Interactome-transcriptome integration for predicting distant metastasis in breast cancer,” Bioinformatics, vol. 28, no. 5, pp. 672–678, 2012. View at Publisher · View at Google Scholar · View at Scopus
- R. Díaz-Uriarte and A. S. de Andrés, “Gene selection and classification of microarray data using random forest,” BMC Bioinformatics, vol. 7, article 3, 2006. View at Publisher · View at Google Scholar · View at Scopus
- L. J. Lancashire, D. G. Powe, J. S. Reis-Filho et al., “A validated gene expression profile for detecting clinical outcome in breast cancer using artificial neural networks,” Breast Cancer Research and Treatment, vol. 120, no. 1, pp. 83–93, 2010. View at Publisher · View at Google Scholar · View at Scopus
- A. Statnikov and C. F. Aliferis, “Are random forests better than support vector machines for microarray-based cancer classification?” AMIA Annual Symposium Proceedings, pp. 686–690, 2007. View at Google Scholar · View at Scopus
- M. Pirooznia, J. Y. Yang, M. Q. Qu, and Y. Deng, “A comparative study of different machine learning methods on microarray gene expression data,” BMC Genomics, vol. 9, supplement 1, article S13, 2008. View at Publisher · View at Google Scholar · View at Scopus
- M. Zucknick, S. Richardson, and E. A. Stronach, “Comparing the characteristics of gene expression profiles derived by univariate and multivariate classification methods,” Statistical Applications in Genetics and Molecular Biology, vol. 7, no. 1, article 7, 2008. View at Google Scholar · View at Scopus
- A. Statnikov, L. Wang, and C. F. Aliferis, “A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification,” BMC Bioinformatics, vol. 9, article 319, 2008. View at Publisher · View at Google Scholar · View at Scopus
- J. Önskog, E. Freyhult, M. Landfors, P. Rydén, and T. R. Hvidsten, “Classification of microarrays; synergistic effects between normalization, gene selection and machine learning,” BMC Bioinformatics, vol. 12, article 390, 2011. View at Publisher · View at Google Scholar · View at Scopus
- S. Y. Kim, “Effects of sample size on robustness and prediction accuracy of a prognostic gene signature,” BMC Bioinformatics, vol. 10, article 147, 2009. View at Publisher · View at Google Scholar · View at Scopus
- Y. Hoshida, “Nearest template prediction: a single-sample-based flexible class prediction with confidence assessment,” PLoS One, vol. 5, no. 11, Article ID e15543, 2010. View at Publisher · View at Google Scholar · View at Scopus
- M. Chen, L. Shi, R. Kelly, R. Perkins, H. Fang, and W. Tong, “Selecting a single model or combining multiple models for microarray-based classifier development?—a comparative analysis based on large and diverse datasets generated from the MAQC-II project,” BMC Bioinformatics, vol. 12, supplement 10, Article ID S3, 2011. View at Publisher · View at Google Scholar · View at Scopus
- M. Zervakis, M. E. Blazadonakis, G. Tsiliki, V. Danilatou, M. Tsiknakis, and D. Kafetzopoulos, “Outcome prediction based on microarray analysis: a critical perspective on methods,” BMC Bioinformatics, vol. 10, article 53, 2009. View at Publisher · View at Google Scholar · View at Scopus
- S. Calza, P. Hall, G. Auer et al., “Intrinsic molecular signature of breast cancer in a population-based cohort of 412 patients,” Breast Cancer Research, vol. 8, no. 4, article R34, 2006. View at Publisher · View at Google Scholar · View at Scopus
- C. Sotiriou, S. Y. Neo, L. M. McShane et al., “Breast cancer classification and prognosis based on gene expression profiles from a population-based study,” Proceedings of the National Academy of Sciences of the United States of America, vol. 100, no. 18, pp. 10393–10398, 2003. View at Publisher · View at Google Scholar · View at Scopus
- C. Sotiriou, P. Wirapati, S. Loi et al., “Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis,” Journal of the National Cancer Institute, vol. 98, no. 4, pp. 262–272, 2006. View at Publisher · View at Google Scholar · View at Scopus
- M. J. van de Vijver, Y. D. He, L. J. van't Veer et al., “A gene-expression signature as a predictor of survival in breast cancer,” New England Journal of Medicine, vol. 347, no. 25, pp. 1999–2009, 2002. View at Publisher · View at Google Scholar · View at Scopus
- E. Huang, S. H. Cheng, H. Dressman et al., “Gene expression predictors of breast cancer outcomes,” The Lancet, vol. 361, no. 9369, pp. 1590–1596, 2003. View at Publisher · View at Google Scholar · View at Scopus
- Y. Wang, J. G. M. Klijn, Y. Zhang et al., “Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer,” The Lancet, vol. 365, no. 9460, pp. 671–679, 2005. View at Publisher · View at Google Scholar · View at Scopus
- M. Buyse, S. Loi, L. van't Veer et al., “Validation and clinical utility of a 70-gene prognostic signature for women with node-negative breast cancer,” Journal of the National Cancer Institute, vol. 98, no. 17, pp. 1183–1192, 2006. View at Publisher · View at Google Scholar · View at Scopus
- M. Schmidt, D. Böhm, C. von Törne et al., “The humoral immune system has a key prognostic impact in node-negative breast cancer,” Cancer Research, vol. 68, no. 13, pp. 5405–5413, 2008. View at Publisher · View at Google Scholar · View at Scopus
- L. J. van't Veer, H. Dai, M. J. van de Vijver et al., “Gene expression profiling predicts clinical outcome of breast cancer,” Nature, vol. 415, no. 6871, pp. 530–536, 2002. View at Publisher · View at Google Scholar · View at Scopus
- C. Desmedt, F. Piette, S. Loi et al., “Strong time dependence of the 76-gene prognostic signature for node-negative breast cancer patients in the TRANSBIG multicenter independent validation series,” Clinical Cancer Research, vol. 13, no. 11, pp. 3207–3214, 2007. View at Publisher · View at Google Scholar · View at Scopus
- L. Nanni, S. Brahnam, and A. Lumini, “Combining multiple approaches for gene microarray classification,” Bioinformatics, vol. 28, no. 8, pp. 1151–1157, 2012. View at Publisher · View at Google Scholar · View at Scopus
- M. Thomassen, Q. Tan, and T. A. Kruse, “Gene expression meta-analysis identifies metastatic pathways and transcription factors in breast cancer,” BMC Cancer, vol. 8, article 394, 2008. View at Publisher · View at Google Scholar · View at Scopus
- M. Thomassen, Q. Tan, and T. A. Kruse, “Gene expression meta-analysis identifies chromosomal regions and candidate genes involved in breast cancer metastasis,” Breast Cancer Research and Treatment, vol. 113, no. 2, pp. 239–249, 2009. View at Publisher · View at Google Scholar · View at Scopus
- L. Breiman, “Random forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, 2001. View at Publisher · View at Google Scholar · View at Scopus
- T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining Inference and Prediction, Springer, New York, NY, USA, 2nd edition, 2009.
- V. Vapnik, The Nature of Statistical Learning Theory, Springer, New York, Ny, USA, 1995.
- A. Dupuy and R. M. Simon, “Critical review of published microarray studies for cancer outcome and guidelines on statistical analysis and reporting,” Journal of the National Cancer Institute, vol. 99, no. 2, pp. 147–157, 2007. View at Publisher · View at Google Scholar · View at Scopus
- R. Hewett and P. Kijsanayothin, “Tumor classification ranking from microarray data,” BMC Genomics, vol. 9, supplement 2, article S21, 2008. View at Publisher · View at Google Scholar · View at Scopus
- G. Natsoulis, L. El Ghaoui, G. R. G. Lanckriet et al., “Classification of a large microarray data set: algorithm comparison and analysis of drug signatures,” Genome Research, vol. 15, no. 5, pp. 724–736, 2005. View at Publisher · View at Google Scholar · View at Scopus
- Y. Peng, “A novel ensemble machine learning for robust microarray data classification,” Computers in Biology and Medicine, vol. 36, no. 6, pp. 553–573, 2006. View at Publisher · View at Google Scholar · View at Scopus
- C. J. Xu, H. C. J. Hoefsloot, and A. K. Smilde, “To aggregate or not to aggregate high-dimensional classifiers,” BMC Bioinformatics, vol. 12, article 153, 2011. View at Publisher · View at Google Scholar · View at Scopus
- S. L. Taylor and K. Kim, “A jackknife and voting classifier approach to feature selection and classification,” Cancer Informatics, vol. 10, pp. 133–147, 2011. View at Publisher · View at Google Scholar · View at Scopus
- A. Statnikov, C. F. Aliferis, I. Tsamardinos, D. Hardin, and S. Levy, “A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis,” Bioinformatics, vol. 21, no. 5, pp. 631–643, 2005. View at Publisher · View at Google Scholar · View at Scopus
- A. C. Tan and D. Gilbert, “Ensemble machine learning on gene expression data for cancer classification,” Appl Bioinformatics, vol. 2, supplement 3, pp. S75–S83, 2003. View at Google Scholar · View at Scopus
- M. S. Sewak, N. P. Reddy, and Z. H. Duan, “Gene expression based leukemia sub-classification using committee neural networks,” Bioinformatics and Biology Insights, vol. 3, pp. 89–98, 2009. View at Google Scholar
- L. Breiman, “Bagging predictors,” Machine Learning, vol. 24, no. 2, pp. 123–140, 1996. View at Google Scholar · View at Scopus
- X. Wang and O. Gotoh, “Accurate molecular classification of cancer using simple rules,” BMC Medical Genomics, vol. 2, article 64, 2009. View at Google Scholar · View at Scopus
- H. T. Huynh, J. J. Kim, and Y. Won, “Performance comparison of SLFN training algorithms for DNA microarray classification,” Advances in Experimental Medicine and Biology, vol. 696, pp. 135–143, 2011. View at Publisher · View at Google Scholar · View at Scopus