Table of Contents Author Guidelines Submit a Manuscript
The Scientific World Journal
Volume 2013, Article ID 720392, 10 pages
http://dx.doi.org/10.1155/2013/720392
Research Article

Clustering-Based Multiple Imputation via Gray Relational Analysis for Missing Data and Its Application to Aerospace Field

State Key Laboratory of Software Development Environment, Beihang University, No. 37 Xueyuan Road, Haidian District, Beijing 100191, China

Received 7 March 2013; Accepted 9 April 2013

Academic Editors: Y.-P. Huang, P. Melin, M. F. G. Penedo, and D. Rodriguez

Copyright © 2013 Jing Tian et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. J. C. Bezdek, J. Keller, R. Krishnapuram, and N. R. Pal, Fuzzy Models and Algorithms for Pattern Recognition and Image Processing, The Handbooks of Fuzzy Sets, Springer, 1999.
  2. A. G. Di Nuovo, “Missing data analysis with fuzzy C-Means: a study of its application in a psychological scenario,” Expert Systems with Applications, vol. 38, pp. 6793–6797, 2011. View at Google Scholar
  3. E. T. Jaynes, “Information theory and statistical mechanics,” Physical Review, vol. 106, no. 4, pp. 620–630, 1957. View at Publisher · View at Google Scholar · View at Scopus
  4. A. N. Baraldi and C. K. Enders, “An introduction to modern missing data analyses,” Journal of School Psychology, vol. 48, no. 1, pp. 5–37, 2010. View at Publisher · View at Google Scholar · View at Scopus
  5. S. Zhang, Z. Jin, X. Zhu, and J. Zhang, “Missing data analysis: a kernel-based multi-imputation approach,” Transactions on Computational Science, vol. 5300, pp. 122–142, 2009. View at Publisher · View at Google Scholar · View at Scopus
  6. A. Farhangfar, L. Kurgan, and W. Pedrycz, “Experimental analysis of methods for imputation of missing values in databases,” in Proceedings of the Intelligent Computing: Theory and Applications II, vol. 5421 of Proceedings of SPIE, pp. 172–182, April 2004. View at Publisher · View at Google Scholar · View at Scopus
  7. P. J. García-Laencina, J. L. Sancho-Gómez, A. R. Figueiras-Vidal, and M. Verleysen, “K nearest neighbours with mutual information for simultaneous classification and missing data imputation,” Neurocomputing, vol. 72, no. 7–9, pp. 1483–1493, 2009. View at Publisher · View at Google Scholar · View at Scopus
  8. J. R. Quinlan, “Induction of decision trees,” Machine Learning, vol. 1, no. 1, pp. 81–106, 1986. View at Publisher · View at Google Scholar · View at Scopus
  9. M. Di Zio, U. Guarnera, and O. Luzi, “Imputation through finite Gaussian mixture models,” Computational Statistics and Data Analysis, vol. 51, no. 11, pp. 5305–5316, 2007. View at Publisher · View at Google Scholar · View at Scopus
  10. Y. Qin, S. Zhang, X. Zhu, J. Zhang, and C. Zhang, “POP algorithm: kernel-based imputation to treat missing values in knowledge discovery from databases,” Expert Systems with Applications, vol. 36, no. 2, pp. 2794–2804, 2009. View at Publisher · View at Google Scholar · View at Scopus
  11. H. Junninen, H. Niska, K. Tuppurainen, J. Ruuskanen, and M. Kolehmainen, “Methods for imputation of missing values in air quality data sets,” Atmospheric Environment, vol. 38, no. 18, pp. 2895–2907, 2004. View at Publisher · View at Google Scholar · View at Scopus
  12. J. L. Schafer, Analysis of Incomplete Multivariate Data, Chapman & Hall/CRC Press, London, UK, 1997.
  13. P. D. Allison, Missing Data, Sage University Papers, Quantitative Applications in the Social Sciences, Thousand Oaks, Calif, USA, 2001.
  14. R. J. A. Little and D. B. Rubin, Statistical Analysis with Missing Data, John Wiley & Sons, Hoboken, NJ, USA, 2nd edition, 2002.
  15. C. Enders, S. Dietz, M. Montague, and J. Dixon, “Modern alternatives for dealing with missing data in special education research,” in Advances in Learning and Behavioral Disorders, T. E. Scruggs and M. A. Mastropieri, Eds., vol. 19,, pp. 101–130, Elsevier, New York, NY, USA, 2006. View at Google Scholar
  16. A. R. T. Donders, G. J. M. G. van der Heijden, T. Stijnen, and K. G. M. Moons, “Review: a gentle introduction to imputation of missing values,” Journal of Clinical Epidemiology, vol. 59, no. 10, pp. 1087–1091, 2006. View at Publisher · View at Google Scholar · View at Scopus
  17. J. F. Muñoz and M. Rueda, “New imputation methods for missing data using quantiles,” Journal of Computational and Applied Mathematics, vol. 232, no. 2, pp. 305–317, 2009. View at Publisher · View at Google Scholar · View at Scopus
  18. S. Zhang, Z. Jin, and X. Zhu, “Missing data imputation by utilizing information within incomplete instances,” Journal of Systems and Software, vol. 84, no. 3, pp. 452–459, 2011. View at Publisher · View at Google Scholar · View at Scopus
  19. B. Zhu, C. He, and P. Liatsis, “A robust missing value imputation method for noisy data,” Applied Intelligence, vol. 36, no. 1, pp. 61–74, 2012. View at Publisher · View at Google Scholar · View at Scopus
  20. S. M. Chen and H. H. Chen, “Estimating null values in the distributed relational databases environment,” Cybernetics and Systems, vol. 31, no. 8, pp. 851–871, 2000. View at Google Scholar · View at Scopus
  21. G. J. McLachlan, K. A. Do, and C. Ambroise, Analyzing Microarray Gene Expression Data, John Wiley & Sons, 2004.
  22. C. C. Huang and H. M. Lee, “A grey-based nearest neighbor approach for missing attribute value prediction,” Applied Intelligence, vol. 20, no. 3, pp. 239–252, 2004. View at Publisher · View at Google Scholar · View at Scopus
  23. E. R. Hruschka Jr., E. R. Hruschka, and N. F. F. Ebecken, “A Bayesian imputation method for a clustering genetic algorithm,” Journal of Computational Methods in Sciences and Engineering, vol. 11, pp. 173–183, 2011. View at Google Scholar
  24. S. M. Chen and C. M. Huang, “Generating weighted fuzzy rules from relational database systems for estimating null values using genetic algorithms,” IEEE Transactions on Fuzzy Systems, vol. 11, no. 4, pp. 495–506, 2003. View at Publisher · View at Google Scholar · View at Scopus
  25. D. Li, J. Deogun, W. Spaulding, and B. Shuart, “Towards missing data imputation: a study of fuzzy K-means clustering method,” in Proceedings of the 4th International Conference (RSCTC '04), pp. 573–579, June 2004. View at Scopus
  26. P. Meesad and K. Hengpraprohm, “Combination of KNN-based feature selection and KNN-based missing-value imputation of microarray data,” in Proceedings of the 3rd International Conference on Innovative Computing Information and Control (ICICIC '08), pp. 341–344, June 2008. View at Publisher · View at Google Scholar · View at Scopus
  27. R. J. Hathaway and J. C. Bezdek, “Fuzzy c-means clustering of incomplete data,” IEEE Transactions on Systems, Man, and Cybernetics B, vol. 31, no. 5, pp. 735–744, 2001. View at Publisher · View at Google Scholar · View at Scopus
  28. http://archive.ics.uci.edu/ml/.
  29. C. Zhang, Y. Qin, X. Zhu, J. Zhang, and S. Zhang, “Clustering-based missing value imputation for data preprocessing,” in Proceedings of the IEEE International Conference on Industrial Informatics (INDIN '06), pp. 1081–1086, August 2006. View at Publisher · View at Google Scholar · View at Scopus
  30. B. Twala, “An empirical comparison of techniques for handling incomplete data using decision trees,” Applied Artificial Intelligence, vol. 23, no. 5, pp. 373–405, 2009. View at Publisher · View at Google Scholar · View at Scopus