Table of Contents Author Guidelines Submit a Manuscript
Mathematical Problems in Engineering
Volume 2015, Article ID 562716, 15 pages
http://dx.doi.org/10.1155/2015/562716
Research Article

Predicting Component Failures Using Latent Dirichlet Allocation

1Key Laboratory of Dependable Service Computing in Cyber Physical Society, Ministry of Education, Chongqing 400044, China
2School of Software Engineering, Chongqing University, Chongqing 401331, China

Received 10 January 2015; Revised 28 May 2015; Accepted 15 June 2015

Academic Editor: Mustapha Nourelfath

Copyright © 2015 Hailin Liu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. L. D. Balk and A. Kedia, “PPT: a COTS integration case study,” in Proceedings of the International Conference on Software Engineering, pp. 42–49, June 2000. View at Scopus
  2. H. C. Cunningham, Y. Liu, P. Tadepalli, and M. Fu, “Component software: a new software engineering course,” Journal of Computing Sciences in Colleges, vol. 18, no. 6, pp. 10–21, 2003. View at Google Scholar
  3. D. Gray, D. Bowes, N. Davey, Y. Sun, and B. Christianson, “Using the support vector machine as a classification method for software defect prediction with static code metrics,” in Engineering Applications of Neural Networks, pp. 223–234, Springer, 2009. View at Publisher · View at Google Scholar
  4. M. Fischer, M. Pinzger, and H. Gall, “Populating a release history database from version control and bug tracking systems,” in Proceedings of the International Conference on Software Maintenance (ICSM '03), pp. 23–32, IEEE, September 2003. View at Publisher · View at Google Scholar · View at Scopus
  5. T. Zimmermann, A. Zeller, P. Weissgerber, and S. Diehl, “Mining version histories to guide software changes,” IEEE Transactions on Software Engineering, vol. 31, no. 6, pp. 429–445, 2005. View at Publisher · View at Google Scholar · View at Scopus
  6. K. El Emam, W. Melo, and J. C. Machado, “The prediction of faulty classes using object-oriented design metrics,” Journal of Systems and Software, vol. 56, no. 1, pp. 63–75, 2001. View at Publisher · View at Google Scholar · View at Scopus
  7. T. J. Ostrand, E. J. Weyuker, and R. M. Bell, “Predicting the location and number of faults in large software systems,” IEEE Transactions on Software Engineering, vol. 31, no. 4, pp. 340–355, 2005. View at Publisher · View at Google Scholar · View at Scopus
  8. T. L. Graves, A. F. Karr, U. S. Marron, and H. Siy, “Predicting fault incidence using software change history,” IEEE Transactions on Software Engineering, vol. 26, no. 7, pp. 653–661, 2000. View at Publisher · View at Google Scholar · View at Scopus
  9. N. Nagappan, T. Ball, and A. Zeller, “Mining metrics to predict component failures,” in Proceedings of the 28th International Conference on Software Engineering (ICSE '06), pp. 452–461, May 2006. View at Scopus
  10. N. S. Gill and P. S. Grover, “Component-based measurement,” ACM SIGSOFT Software Engineering Notes, vol. 28, no. 6, 2003. View at Publisher · View at Google Scholar
  11. Y. Liu, D. Poshyvanyk, R. Ferenc, T. Gyimóthy, and N. Chrisochoides, “Modeling class cohesion as mixtures of latent topics,” in Proceedings of the IEEE International Conference on Software Maintenance (ICSM '09), pp. 233–242, September 2009. View at Publisher · View at Google Scholar · View at Scopus
  12. T. T. Nguyen, T. N. Nguyen, and T. M. Phuong, “Topic-based defect prediction (NIER track),” in Proceeding of the 33rd International Conference on Software Engineering (ICSE '11), pp. 932–935, May 2011. View at Publisher · View at Google Scholar · View at Scopus
  13. G. Maskeri, S. Sarkar, and K. Heafield, “Mining business topics in source code using latent dirichlet allocation,” in Proceedings of the 1st India Software Engineering Conference (ISEC '08), pp. 113–120, February 2008. View at Publisher · View at Google Scholar · View at Scopus
  14. D. M. Blei, A. Y. Ng, and M. I. Jordan, “Latent Dirichlet allocation,” Journal of Machine Learning Research, vol. 3, no. 4-5, pp. 993–1022, 2003. View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  15. T.-H. Chen, S. W. Thomas, M. Nagappan, and A. E. Hassan, “Explaining software defects using topic models,” in Proceedings of the 9th IEEE Working Conference on Mining Software Repositories (MSR '12), pp. 189–198, June 2012. View at Publisher · View at Google Scholar · View at Scopus
  16. F. Elberzhager, A. Rosbach, J. Münch, and R. Eschbach, “Reducing test effort: A systematic mapping study on existing approaches,” Information and Software Technology, vol. 54, no. 10, pp. 1092–1106, 2012. View at Publisher · View at Google Scholar · View at Scopus
  17. M. M. T. Thwin and T.-S. Quah, “Application of neural networks for software quality prediction using object-oriented metrics,” Journal of Systems and Software, vol. 76, no. 2, pp. 147–156, 2005. View at Publisher · View at Google Scholar · View at Scopus
  18. T. Gyimóthy, R. Ferenc, and I. Siket, “Empirical validation of object-oriented metrics on open source software for fault prediction,” IEEE Transactions on Software Engineering, vol. 31, no. 10, pp. 897–910, 2005. View at Publisher · View at Google Scholar · View at Scopus
  19. R. Malhotra, “A systematic review of machine learning techniques for software fault prediction,” Applied Soft Computing, vol. 27, pp. 504–518, 2015. View at Publisher · View at Google Scholar · View at Scopus
  20. L. Guo, Y. Ma, B. Cukic, and H. Singh, “Robust prediction of fault-proneness by random forests,” in Proceedings of the 15th International Symposium on Software Reliability Engineering (ISSRE '04), pp. 417–428, IEEE, November 2004. View at Publisher · View at Google Scholar · View at Scopus
  21. Y. Peng, G. Kou, G. Wang, W. Wu, and Y. Shi, “Ensemble of software defect predictors: an AHP-based evaluation method,” International Journal of Information Technology and Decision Making, vol. 10, no. 1, pp. 187–206, 2011. View at Publisher · View at Google Scholar · View at Scopus
  22. B. Turhan and A. Bener, “A multivariate analysis of static code attributes for defect prediction,” in Proceeding of the 33rd 7th International Conference on Quality Software (QSIC '07), pp. 231–237, October 2007. View at Publisher · View at Google Scholar · View at Scopus
  23. K. O. Elish and M. O. Elish, “Predicting defect-prone software modules using support vector machines,” Journal of Systems and Software, vol. 81, no. 5, pp. 649–660, 2008. View at Publisher · View at Google Scholar · View at Scopus
  24. S. Krishnan, C. Strasburg, R. R. Lutz, K. Goseva-Popstojanova, and K. S. Dorman, “Predicting failure-proneness in an evolving software product line,” Information and Software Technology, vol. 55, no. 8, pp. 1479–1495, 2013. View at Publisher · View at Google Scholar · View at Scopus
  25. X.-Y. Jing, S. Ying, Z.-W. Zhang, S.-S. Wu, and J. Liu, “Dictionary learning based software defect prediction,” in Proceedings of the 36th International Conference on Software Engineering (ICSE '14), pp. 414–423, ACM, Hyderabad, India, May-June 2014. View at Publisher · View at Google Scholar
  26. B. Caglayan, A. Tosun Misirli, A. B. Bener, and A. Miranskyy, “Predicting defective modules in different test phases,” Software Quality Journal, vol. 23, no. 2, pp. 205–227, 2014. View at Publisher · View at Google Scholar · View at Scopus
  27. X. Yang, K. Tang, and X. Yao, “A learning-to-rank approach to software defect prediction,” IEEE Transactions on Reliability, vol. 64, no. 1, pp. 234–246, 2015. View at Publisher · View at Google Scholar
  28. N. Ullah, “A method for predicting open source software residual defects,” Software Quality Journal, vol. 23, no. 1, pp. 55–76, 2015. View at Publisher · View at Google Scholar
  29. G. Abaei, A. Selamat, and H. Fujita, “An empirical study based on semi-supervised hybrid self-organizing map for software fault prediction,” Knowledge-Based Systems, vol. 74, pp. 28–39, 2015. View at Publisher · View at Google Scholar
  30. T. M. Khoshgoftaar, E. B. Allen, K. S. Kalaichelvan, and N. Goel, “Early quality prediction: a case study in telecommunications,” IEEE Software, vol. 13, no. 1, pp. 65–71, 1996. View at Publisher · View at Google Scholar · View at Scopus
  31. N. Ohlsson and H. Alberg, “Predicting fault-prone software modules in telephone switches,” IEEE Transactions on Software Engineering, vol. 22, no. 12, pp. 886–894, 1996. View at Publisher · View at Google Scholar · View at Scopus
  32. S. Neuhaus, T. Zimmermann, C. Holler, and A. Zeller, “Predicting vulnerable software components,” in Proceedings of the 14th ACM Conference on Computer and Communications Security (CCS '07), pp. 529–540, ACM, Denver, Colo, USA, November 2007. View at Publisher · View at Google Scholar · View at Scopus
  33. A. Schröter, T. Zimmermann, and A. Zeller, “Predicting component failures at design time,” in Proceedings of the 5th ACM-IEEE International Symposium on Empirical Software Engineering (ISCE '06), pp. 18–27, September 2006. View at Publisher · View at Google Scholar · View at Scopus
  34. S. W. Thomas, B. Adams, A. E. Hassan, and D. Blostein, “Studying software evolution using topic models,” Science of Computer Programming, vol. 80, pp. 457–479, 2014. View at Publisher · View at Google Scholar · View at Scopus
  35. S. W. Thomas, B. Adams, A. E. Hassan, and D. Blostein, “Modeling the evolution of topics in source code histories,” in Proceedings of the 8th Working Conference on Mining Software Repositories (MSR '11), pp. 173–182, May 2011. View at Publisher · View at Google Scholar · View at Scopus
  36. S. K. Lukins, N. A. Kraft, and L. H. Etzkorn, “Bug localization using latent Dirichlet allocation,” Information and Software Technology, vol. 52, no. 9, pp. 972–990, 2010. View at Publisher · View at Google Scholar · View at Scopus
  37. R. Scheaffer, J. McClave, and E. R. Ziegel, “Probability and statistics for engineers,” Technometrics, vol. 37, no. 2, p. 239, 1995. View at Publisher · View at Google Scholar
  38. P. Knab, M. Pinzger, and A. Bernstein, “Predicting defect densities in source code files with decision tree learners,” in Proceedings of the International Workshop on Mining Software Repositories (MSR '06), pp. 119–125, May 2006. View at Publisher · View at Google Scholar · View at Scopus
  39. F. Akiyama, “An example of software system debugging,” in Proceedings of the IFIP Congress, pp. 353–359, Ljubljana, Slovenia, 1971.
  40. A. Hindle, M. W. Godfrey, and R. C. Holt, “What's hot and what's not: Windowed developer topic analysis,” in Proceedings of the IEEE International Conference on Software Maintenance (ICSM '09), pp. 339–348, IEEE, Edmonton, UK, September 2009. View at Publisher · View at Google Scholar · View at Scopus
  41. H. M. Wallach, I. Murray, R. Salakhutdinov, and D. Mimno, “Evaluation methods for topic models,” in Proceedings of the 26th Annual International Conference on Machine Learning (ICML '09), vol. 4, pp. 1105–1112, 2009. View at Publisher · View at Google Scholar
  42. T. L. Griffiths and M. Steyvers, “Finding scientific topics,” Proceedings of the National Academy of Sciences of the United States of America, vol. 101, supplement 1, pp. 5228–5235, 2004. View at Google Scholar
  43. S. Grant and J. R. Cordy, “Estimating the optimal number of latent concepts in source code analysis,” in Proceedings of the 10th IEEE International Working Conference on Source Code Analysis and Manipulation (SCAM '10), pp. 65–74, September 2010. View at Publisher · View at Google Scholar · View at Scopus
  44. A. D. Lovie, “Who discovered Spearman's rank correlation?” British Journal of Mathematical and Statistical Psychology, vol. 48, no. 2, pp. 255–269, 1995. View at Publisher · View at Google Scholar
  45. M. D'Ambros, M. Lanza, and R. Robbes, “An extensive comparison of bug prediction approaches,” in Proceedings of the 7th IEEE Working Conference on Mining Software Repositories (MSR '10), pp. 31–41, IEEE, May 2010. View at Publisher · View at Google Scholar · View at Scopus