Table of Contents
Journal of Quality and Reliability Engineering
Volume 2015 (2015), Article ID 795154, 9 pages
http://dx.doi.org/10.1155/2015/795154
Research Article

Variable Selection Methods for Right-Censored Time-to-Event Data with High-Dimensional Covariates

Department of Mechanical and Industrial Engineering, Northeastern University, Boston, MA 02115, USA

Received 1 October 2014; Revised 13 April 2015; Accepted 16 April 2015

Academic Editor: Christian Kirchsteiger

Copyright © 2015 Keivan Sadeghzadeh and Nasser Fard. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. F. Yao, “Functional principal component analysis for longitudinal and survival data,” Statistica Sinica, vol. 17, no. 3, pp. 965–983, 2007. View at Google Scholar · View at MathSciNet · View at Scopus
  2. J. Hellerstein, “Parallel Programming in the Age of Big Data,” Gigaom Blog, 2008.
  3. T. Segaran and J. Hammerbacher, Beautiful Data: The Stories Behind Elegant Data Solutions, O'Reilly Media, 2009.
  4. D. Feldman, M. Schmidt, and C. Sohler, “Turning Big data into tiny data: constant-size coresets for k-means, PCA and projective clustering,” in Proceedings of the 24th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '13), pp. 1434–1453, January 2013. View at Scopus
  5. J. Manyika, M. Chui, B. Brown et al., Big Data: The Next Frontier for Innovation, Competition, and Productivity, McKinsey Global Institute, 2011.
  6. J. Moran, Is Big Data a Big Problem for Manufacturers?Sikich, 2013.
  7. B. Brown, M. Chul, and J. Manyika, “Are you ready for the era of ‘big data’?” McKinsey Quarterly, no. 4, pp. 24–35, 2011. View at Google Scholar · View at Scopus
  8. S. Lohr, “The age of big data,” New York Times, 2012. View at Google Scholar
  9. E. Brynjolfsson, A Revolution in Decision-Making Improves Productivity, MIT Press, 2012.
  10. P. Russom, “Big data analytics,” TDWI Best Practices Report, Fourth Quarter, 2011. View at Google Scholar
  11. K. Sadeghzadeh and M. B. Salehi, “Mathematical analysis of fuel cell strategic technologies development solutions in the automotive industry by the TOPSIS multi-criteria decision making method,” International Journal of Hydrogen Energy, vol. 36, no. 20, pp. 13272–13280, 2011. View at Publisher · View at Google Scholar · View at Scopus
  12. J. Chai, J. N. K. Liu, and E. W. T. Ngai, “Application of decision-making techniques in supplier selection: a systematic review of literature,” Expert Systems with Applications, vol. 40, no. 10, pp. 3872–3885, 2013. View at Publisher · View at Google Scholar · View at Scopus
  13. A. R. Tenner and I. J. Detoro, Total Quality Management: Three Steps to Continuous Improvement, Addison-Wesley, 1994.
  14. D. A. Garvin, “Competing on the eight dimensions of quality,” IEEE Engineering Management Review, vol. 24, no. 1, pp. 15–23, 1996. View at Google Scholar · View at Scopus
  15. D. H. Besterfield, C. Besterfield-Michna, G. H. Besterfield, and M. Besterfield-Sacre, Total Quality Management, Prentice Hall, Englewood Cliffs, NJ, USA, 1995.
  16. E. Auschitzky, M. Hammer, and A. Rajagopaul, How Big Data Can Improve Manufacturing, McKinsey & Company, 2014.
  17. M. Nemschoff, How Big Data Can Improve Manufacturing Quality, 2014.
  18. H. Akaike, “An information criterion (AIC),” Mathematical Sciences, 1976. View at Google Scholar
  19. G. Schwarz, “Estimating the dimension of a model,” The Annals of Statistics, vol. 6, no. 2, pp. 461–464, 1978. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  20. D. R. Cox, “Regression models and life-tables,” Journal of the Royal Statistical Society. Series B. Methodological, vol. 34, pp. 187–220, 1972. View at Google Scholar · View at MathSciNet
  21. J. D. Kalbfleisch and R. L. Prentice, The Statistical Analysis of Failure Time Data, John Wiley & Sons, 2011.
  22. J. Buckley and I. James, “Linear regression with censored data,” Biometrika, vol. 66, no. 3, pp. 429–436, 1979. View at Publisher · View at Google Scholar · View at Scopus
  23. H. Ishwaran, U. B. Kogalur, E. H. Blackstone, and M. S. Lauer, “Random survival forests,” The Annals of Applied Statistics, vol. 2, no. 3, pp. 841–860, 2008. View at Publisher · View at Google Scholar · View at Scopus
  24. S. Ma, M. R. Kosorok, and J. P. Fine, “Additive risk models for survival data with high-dimensional covariates,” Biometrics, vol. 62, no. 1, pp. 202–210, 2006. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  25. J. Huang, S. Ma, and H. Xie, “Regularized estimation in the accelerated failure time model with high-dimensional covariates,” Biometrics. Journal of the International Biometric Society, vol. 62, no. 3, pp. 813–820, 2006. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  26. L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone, Classification and Regression Trees, Wadsworth & Brooks, 1984.
  27. E. T. Lee and J. W. Wang, Statistical Methods for Survival Data Analysis, John Wiley & Sons, 2003.
  28. Jr. Hosmer D. W. and S. Lemeshow, Applied Survival Analysis: Regression Modeling of Time to Event Data, John Wiley & Sons, New York, NY, USA, 1999. View at MathSciNet
  29. L.-P. Kronek and A. Reddy, “Logical analysis of survival data: prognostic survival models by detecting high-degree interactions in right-censored data,” Bioinformatics, vol. 24, no. 16, pp. i248–i253, 2008. View at Publisher · View at Google Scholar · View at Scopus
  30. T. R. Holford, Multivariate Methods in Epidemiology, Oxford University Press, Oxford, UK, 2002.
  31. A. V. Peterson Jr., “Expressing the Kaplan-Meier estimator as a function of empirical subsurvival functions,” Journal of the American Statistical Association, vol. 72, no. 360, p. 854, 1977. View at Publisher · View at Google Scholar
  32. J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, Morgan Kaufmann Publishers, 2006.
  33. P. Tan, M. Steinbach, and V. Kumar, Introduction to Data Mining, Addison Wesley, Pearson, 2006.
  34. I. Jolliffe, Principal Component Analysis, John Wiley & Sons, New York, NY, USA, 2005.
  35. H. Abdi and L. J. Williams, “Principal component analysis,” Wiley Interdisciplinary Reviews: Computational Statistics, vol. 2, no. 4, pp. 433–459, 2010. View at Publisher · View at Google Scholar
  36. H. Motoda and L. Huan, Feature Selection, Extraction and Construction, Communication of IICM, 2002.
  37. D. Addison, S. Wermter, and G. Z. Arevian, “A comparison of feature extraction and selection techniques,” in Proceedings of the International Conference on Artificial Neural Networks, Istanbul, Turkey, 2003.
  38. J. Ye, Unsupervised and Supervised Dimension Reduction: Algorithms and Connections, Arizona State University, Tempe, Ariz, USA, 2005.
  39. Y. Saeys, I. Inza, and P. Larrañaga, “A review of feature selection techniques in bioinformatics,” Bioinformatics, vol. 23, no. 19, pp. 2507–2517, 2007. View at Publisher · View at Google Scholar · View at Scopus
  40. I. Iguyon and A. Elisseeff, “An introduction to variable and feature selection,” Journal of Machine Learning Research, vol. 3, pp. 1157–1182, 2003. View at Google Scholar · View at Scopus
  41. K. Sadeghzadeh and N. Fard, “Nonparametric data reduction approach for large-scale survival data analysis,” IEEE. In press.
  42. H. Liu, F. Hussain, C. L. Tan, and M. Dash, “Discretization: an enabling technique,” Data Mining and Knowledge Discovery, vol. 6, no. 4, pp. 393–423, 2002. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  43. K. Sadeghzadeh and N. Fard, “Multidisciplinary decision-making approach to high-dimensional event history analysis through variable reduction methods,” European Journal of Economics and Management, vol. 1, no. 2, 2014. View at Google Scholar
  44. T. A. Feo and M. G. C. Resende, “Greedy randomized adaptive search procedures,” Journal of Global Optimization, vol. 6, no. 2, pp. 109–133, 1995. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  45. J. P. Hart and A. W. Shogan, “Semi-greedy heuristics: an empirical study,” Operations Research Letters, vol. 6, no. 3, pp. 107–114, 1987. View at Publisher · View at Google Scholar · View at Scopus
  46. M. Gendreau and J. Y. Potvin, “Tabu Search,” in Search Methodologies, Springer, Berlin, Germany, 2005. View at Google Scholar
  47. W. R. Zwick and W. F. Velicer, “Comparison of five rules for determining the number of components to retain,” Psychological Bulletin, vol. 99, no. 3, pp. 432–442, 1986. View at Publisher · View at Google Scholar · View at Scopus
  48. I. Ishwaran and U. B. Kogalur, “Random survival forests for R,” Rnews, 2007. View at Google Scholar