Table of Contents Author Guidelines Submit a Manuscript
Journal of Probability and Statistics
Volume 2009, Article ID 537139, 18 pages
http://dx.doi.org/10.1155/2009/537139
Review Article

The Rise of Markov Chain Monte Carlo Estimation for Psychometric Modeling

Division of Advanced Studies in Learning, Technology and Psychology in Education, Arizona State University, PO Box 870611, Tempe, AZ 85287-0611, USA

Received 7 August 2009; Accepted 26 November 2009

Academic Editor: Peter van der Heijden

Copyright © 2009 Roy Levy. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. K. A. Bollen, Structural Equations with Latent Variables, Wiley Series in Probability and Mathematical Statistics, John Wiley & Sons, New York, NY, USA, 1989. View at MathSciNet
  2. R. L. Gorsuch, Factor Analysis, Lawrence Earlbaum Associates, Hillsdale, NJ, USA, 2nd edition, 1983.
  3. F. M. Lord, Applications of Item Response Theory to Practical Testing Problems, Lawrence Earlbaum Associates, Hillsdale, NJ, USA, 1980.
  4. P. F. Lazarsfeld and N. W. Henry, Latent Structure Analysis, Houghton Mifflin, Boston, Mass, USA, 1968.
  5. A. Gelman, J. B. Carlin, H. S. Stern, and D. B. Rubin, Bayesian Data Analysis, Texts in Statistical Science Series, Chapman & Hall, London, UK, 1995. View at MathSciNet
  6. X.-Y. Song and S.-Y. Lee, “Full maximum likelihood estimation of polychoric and polyserial correlations with missing data,” Multivariate Behavioral Research, vol. 38, no. 1, pp. 57–79, 2003. View at Google Scholar · View at Scopus
  7. D. V. Lindley and A. F. M. Smith, “Bayes estimates for the linear model,” Journal of the Royal Statistical Society. Series B, vol. 34, pp. 1–41, 1972. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  8. R. J. Mislevy, “Bayes modal estimation in item response models,” Psychometrika, vol. 51, no. 2, pp. 177–195, 1986. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  9. D. V. Lindley and M. R. Novick, “The role of exchangeability in inference,” The Annals of Statistics, vol. 9, no. 1, pp. 45–58, 1981. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  10. P. De Boeck and M. Wilson, Eds., Explanatory Item Response Models: A Generalized Linear and Nonlinear Approach, Statistics for Social Science and Public Policy, Springer, New York, NY, USA, 2004. View at MathSciNet
  11. S.-Y. Lee, Structural Equation Modeling: A Bayesian Approach, Wiley Series in Probability and Statistics, John Wiley & Sons, Chichester, UK, 2007. View at MathSciNet
  12. S. P. Brooks, “Markov chain Monte Carlo method and its application,” The Statistician, vol. 47, no. 1, pp. 69–100, 1998. View at Google Scholar · View at Scopus
  13. A. E. Gelfand and A. F. M. Smith, “Sampling-based approaches to calculating marginal densities,” Journal of the American Statistical Association, vol. 85, no. 410, pp. 398–409, 1990. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  14. W. R. Gilks, S. Richardson, and D. J. Spiegelhalter, Eds., Markov Chain Monte Carlo in Practice, Interdisciplinary Statistics, Chapman & Hall, London, UK, 1996. View at MathSciNet
  15. A. F. M. Smith and G. O. Roberts, “Bayesian computation via the Gibbs sampler and related Markov chain Monte Carlo methods,” Journal of the Royal Statistical Society. Series B, vol. 55, no. 1, pp. 3–23, 1993. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  16. D. J. Spiegelhalter, A. Thomas, N. G. Best, and D. Lunn, WinBUGS User Manual: Version 1.4.3, MRC Biostatistics Unit, Cambridge, UK, 2007.
  17. L. Tierney, “Markov chains for exploring posterior distributions,” The Annals of Statistics, vol. 22, no. 4, pp. 1701–1762, 1994. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  18. G. O. Roberts, “Markov chain concepts related to sampling algorithms,” in Markov Chain Monte Carlo in Practice, W. R. Gilks, S. Richardson, and D. J. Spiegelhalter, Eds., Interdiscip. Statist., pp. 45–57, Chapman & Hall, London, UK, 1996. View at Google Scholar · View at MathSciNet
  19. J. Besag, “Spatial interaction and the statistical analysis of lattice systems,” Journal of the Royal Statistical Society. Series B, vol. 36, pp. 192–236, 1974. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  20. S. Geman and D. Geman, “Stochastic relaxation, Gibbs distributions and the Bayesian restoration of images,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 6, no. 6, pp. 721–741, 1984. View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  21. G. Casella and E. I. George, “Explaining the Gibbs sampler,” The American Statistician, vol. 46, no. 3, pp. 167–174, 1992. View at Google Scholar · View at MathSciNet
  22. W. K. Hastings, “Monte Carlo sampling methods using Markov chains and their applications,” Biometrika, vol. 57, no. 1, pp. 97–109, 1970. View at Google Scholar
  23. S. Chib and E. Greenberg, “Understanding the Metropolis-Hastings algorithm,” The American Statistician, vol. 49, no. 4, pp. 327–335, 1995. View at Google Scholar
  24. N. Metropolis, A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller, and E. Teller, “Equation of state calculations by fast computing machines,” The Journal of Chemical Physics, vol. 21, no. 6, pp. 1087–1092, 1953. View at Google Scholar · View at Scopus
  25. R. Scheines, H. Hoijtink, and A. Boomsma, “Bayesian estimation and testing of structural equation models,” Psychometrika, vol. 64, no. 1, pp. 37–52, 1999. View at Google Scholar · View at Scopus
  26. A. Ansari and K. Jedidi, “Bayesian factor analysis for multilevel binary observations,” Psychometrika, vol. 65, no. 4, pp. 475–496, 2000. View at Publisher · View at Google Scholar · View at MathSciNet
  27. S.-Y. Lee and X.-Y. Song, “Evaluation of the Bayesian and maximum likelihood approaches in analyzing structural equation models with small sample sizes,” Multivariate Behavioral Research, vol. 39, no. 4, pp. 653–686, 2004. View at Publisher · View at Google Scholar · View at Scopus
  28. A. Ansari, K. Jedidi, and L. Dube, “Heterogeneous factor analysis models: a Bayesian approach,” Psychometrika, vol. 67, no. 1, pp. 49–78, 2002. View at Publisher · View at Google Scholar · View at MathSciNet
  29. Z. Zhang, F. Hamagami, L. Wang, J. R. Nesselroade, and K. J. Grimm, “Bayesian analysis of longitudinal data using growth curve models,” International Journal of Behavioral Development, vol. 31, no. 4, pp. 374–383, 2007. View at Publisher · View at Google Scholar · View at Scopus
  30. S.-Y. Lee and X.-Y. Song, “Bayesian model selection for mixtures of structural equation models with an unknown number of components,” The British Journal of Mathematical and Statistical Psychology, vol. 56, no. 1, pp. 145–165, 2003. View at Publisher · View at Google Scholar · View at PubMed · View at MathSciNet
  31. H.-T. Zhu and S.-Y. Lee, “A Bayesian analysis of finite mixtures in the LISREL model,” Psychometrika, vol. 66, no. 1, pp. 133–152, 2001. View at Publisher · View at Google Scholar · View at MathSciNet
  32. S.-Y. Lee, X.-Y. Song, and N.-S. Tang, “Bayesian methods for analyzing structural equation models with covariates, interaction, and quadratic latent variables,” Structural Equation Modeling, vol. 14, no. 3, pp. 404–434, 2007. View at Google Scholar · View at MathSciNet
  33. S.-Y. Lee and N.-S. Tang, “Bayesian analysis of nonlinear structural equation models with nonignorable missing data,” Psychometrika, vol. 71, no. 3, pp. 541–564, 2006. View at Publisher · View at Google Scholar · View at MathSciNet
  34. G. Arminger and B. O. Muthén, “A Bayesian approach to nonlinear latent variable models using the Gibbs sampler and the Metropolis-Hastings algorithm,” Psychometrika, vol. 63, no. 3, pp. 271–300, 1998. View at Google Scholar · View at Scopus
  35. X.-Y. Song, S.-Y. Lee, and Y.-I. Hser, “Bayesian analysis of multivariate latent curve models with nonlinear longitudinal latent effects,” Structural Equation Modeling, vol. 16, no. 2, pp. 245–266, 2009. View at Publisher · View at Google Scholar · View at PubMed · View at Scopus
  36. R. J. Wirth and M. C. Edwards, “Item factor analysis: current approaches and future directions,” Psychological Methods, vol. 12, no. 1, pp. 58–79, 2007. View at Publisher · View at Google Scholar · View at PubMed · View at Scopus
  37. R. D. Bock and M. Aitkin, “Marginal maximum likelihood estimation of item parameters: application of an EM algorithm,” Psychometrika, vol. 46, no. 4, pp. 443–459, 1981. View at Publisher · View at Google Scholar · View at MathSciNet
  38. F. B. Baker and S.-H. Kim, Eds., Item Response Theory: Parameter Estimation Techniques, vol. 176 of Statistics: Textbooks and Monographs, Marcel Dekker, New York, NY, USA, 2nd edition, 2004. View at MathSciNet
  39. R. D. Bock and R. J. Mislevy, “Adaptive EAP estimation of ability in a microcomputer environment,” Applied Psychological Measurement, vol. 6, pp. 431–444, 1982. View at Google Scholar
  40. J. H. Albert, “Bayesian estimation of normal ogive item response curves using Gibbs sampling,” Journal of Educational Statistics, vol. 17, pp. 251–269, 1992. View at Google Scholar
  41. M. A. Tanner and W. H. Wong, “The calculation of posterior distributions by data augmentation,” Journal of the American Statistical Association, vol. 82, no. 398, pp. 528–550, 1987. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  42. J. H. Albert and S. Chib, “Bayesian analysis of binary and polychotomous response data,” Journal of the American Statistical Association, vol. 88, no. 422, pp. 669–679, 1993. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  43. S. K. Sahu, “Bayesian estimation and model choice in item response models,” Journal of Statistical Computation and Simulation, vol. 72, no. 3, pp. 217–232, 2002. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  44. R. J. Patz and B. W. Junker, “A straightforward approach to Markov chain Monte Carlo methods for item response models,” Journal of Educational and Behavioral Statistics, vol. 24, no. 2, pp. 146–178, 1999. View at Google Scholar · View at Scopus
  45. R. J. Patz and B. W. Junker, “Applications and extensions of MCMC in IRT: multiple item types, missing data, and rated responses,” Journal of Educational and Behavioral Statistics, vol. 24, no. 4, pp. 342–366, 1999. View at Google Scholar · View at Scopus
  46. J. A. Wollack, D. M. Bolt, A. S. Cohen, and Y.-S. Lee, “Recovery of item parameters in the nominal response model: a comparison of marginal maximum likelihood estimation and Markov chain Monte Carlo estimation,” Applied Psychological Measurement, vol. 26, no. 3, pp. 339–352, 2002. View at Publisher · View at Google Scholar · View at MathSciNet
  47. E. T. Bradlow, H. Wainer, and X. Wang, “A Bayesian random effects model for testlets,” Psychometrika, vol. 64, no. 2, pp. 153–168, 1999. View at Google Scholar · View at Scopus
  48. J.-P. Fox and C. A. W. Glas, “Bayesian estimation of a multilevel IRT model using Gibbs sampling,” Psychometrika, vol. 66, no. 2, pp. 271–288, 2001. View at Publisher · View at Google Scholar · View at MathSciNet
  49. R. Janssen, F. Tuerlinckx, M. Meulders, and P. De Boeck, “A hierarchical IRT model for criterion-referenced measurement,” Journal of Educational and Behavioral Statistics, vol. 25, no. 3, pp. 285–306, 2000. View at Google Scholar · View at Scopus
  50. Y. Takane and J. de Leeuw, “On the relationship between item response theory and factor analysis of discretized variables,” Psychometrika, vol. 52, no. 3, pp. 393–408, 1987. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  51. A. A. Béguin and C. A. W. Glas, “MCMC estimation and some model-fit analysis of multidimensional IRT models,” Psychometrika, vol. 66, no. 4, pp. 541–561, 2001. View at Publisher · View at Google Scholar · View at MathSciNet
  52. D. M. Bolt and V. F. Lall, “Estimation of compensatory and noncompensatory multidimensional item response models using Markov chain Monte Carlo,” Applied Psychological Measurement, vol. 27, no. 6, pp. 395–414, 2003. View at Publisher · View at Google Scholar · View at MathSciNet
  53. S. Jackman, “Multidimensional analysis of roll call data via Bayesian simulation: identification, estimation, inference, and model checking,” Political Analysis, vol. 9, no. 3, pp. 227–241, 2001. View at Google Scholar
  54. L. Yao and K. A. Boughton, “A multidimensional item response modeling approach for improving subscale proficiency estimation and classification,” Applied Psychological Measurement, vol. 31, no. 2, pp. 83–105, 2007. View at Publisher · View at Google Scholar · View at MathSciNet
  55. S.-Y. Lee and H.-T. Zhu, “Statistical analysis of nonlinear structural equation models with continuous and polytomous data,” British Journal of Mathematical and Statistical Psychology, vol. 53, no. 2, pp. 209–232, 2000. View at Google Scholar · View at Scopus
  56. J.-Q. Shi and S.-Y. Lee, “Bayesian sampling-based approach for factor analysis models with continuous and polytomous data,” British Journal of Mathematical and Statistical Psychology, vol. 51, no. 2, pp. 233–252, 1998. View at Google Scholar · View at Scopus
  57. X.-Y. Song and S.-Y. Lee, “Bayesian estimation and test for factor analysis model with continuous and polytomous data in several populations,” British Journal of Mathematical and Statistical Psychology, vol. 54, no. 2, pp. 237–259, 2001. View at Publisher · View at Google Scholar · View at Scopus
  58. X.-Y. Song and S.-Y. Lee, “Analysis of structural equation model with ignorable missing continuous and polytomous data,” Psychometrika, vol. 67, no. 2, pp. 261–288, 2002. View at Publisher · View at Google Scholar · View at MathSciNet
  59. H. Hoijtink, “Constrained latent class analysis using the Gibbs sampler and posterior predictive p-values: applications to educational testing,” Statistica Sinica, vol. 8, no. 3, pp. 691–711, 1998. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  60. H. Hoijtink and A. Notenboom, “Model based clustering of large data sets: tracing the development of spelling ability,” Psychometrika, vol. 69, no. 3, pp. 481–498, 2004. View at Publisher · View at Google Scholar · View at MathSciNet
  61. H. Chung, B. P. Flaherty, and J. L. Schafer, “Latent class logistic regression: application to marijuana use and attitudes among high school seniors,” Journal of the Royal Statistical Society. Series A, vol. 169, no. 4, pp. 723–743, 2006. View at Publisher · View at Google Scholar · View at MathSciNet
  62. M. J. H. van Onna, “Bayesian estimation and model selection in ordered latent class models for polytomous items,” Psychometrika, vol. 67, no. 4, pp. 519–538, 2002. View at Publisher · View at Google Scholar · View at MathSciNet
  63. A. A. Rupp and J. L. Templin, “Unique characteristics of diagnostic classification models: a comprehensive review of the current state-of-the-art,” Measurement: Interdisciplinary Research and Perspectives, vol. 6, no. 4, pp. 219–262, 2008. View at Publisher · View at Google Scholar
  64. J. de la Torre and J. A. Douglas, “Higher-order latent trait models for cognitive diagnosis,” Psychometrika, vol. 69, no. 3, pp. 333–353, 2004. View at Publisher · View at Google Scholar · View at MathSciNet
  65. S. M. Hartz, A Bayesian framework for the unified model for assessing cognitive abilities: blending theory with practicality, Doctoral dissertation, Department of Statistics, University of Illinois at Urbana-Champaign, Urbana, Ill, USA, 2002.
  66. R. A. Henson, J. L. Templin, and F. Porch, “Description of the underlying algorithm of the improved Arpeggio,” Unpublished ETS Project Report, Princeton, NJ, USA, 2004.
  67. J. L. Templin and R. A. Henson, “Measurement of psychological disorders using cognitive diagnosis models,” Psychological Methods, vol. 11, no. 3, pp. 287–305, 2006. View at Publisher · View at Google Scholar · View at PubMed · View at Scopus
  68. R. G. Almond, L. V. DiBello, B. Moulder, and J.-D. Zapata-Rivera, “Modeling diagnostic assessments with Bayesian networks,” Journal of Educational Measurement, vol. 44, no. 4, pp. 341–359, 2007. View at Publisher · View at Google Scholar · View at Scopus
  69. R. Levy and R. J. Mislevy, “Specifying and refining a measurement model for a simulation-based assessment,” International Journal of Measurement, vol. 4, pp. 333–369, 2004. View at Google Scholar
  70. A. A. Rupp, “Feature selection for choosing and assembling measurement models: a building-block-based organization,” International Journal of Testing, vol. 2, no. 3-4, pp. 311–360, 2002. View at Publisher · View at Google Scholar
  71. D. M. Bolt, A. S. Cohen, and J. A. Wollack, “A mixture item response model for multiple-choice data,” Journal of Educational and Behavioral Statistics, vol. 26, no. 4, pp. 381–409, 2001. View at Google Scholar · View at Scopus
  72. A. S. Cohen and D. M. Bolt, “A mixture model analysis of differential item functioning,” Journal of Educational Measurement, vol. 42, no. 2, pp. 133–148, 2005. View at Publisher · View at Google Scholar · View at Scopus
  73. K. A. Bollen, “Latent variables in psychology and the social sciences,” Annual Review of Psychology, vol. 53, pp. 605–634, 2002. View at Publisher · View at Google Scholar · View at PubMed · View at Scopus
  74. S. Jackman, “Estimation and inference are missing data problems: unifying social science statistics via Bayesian simulation,” Political Analysis, vol. 8, pp. 307–332, 2000. View at Google Scholar
  75. L. T. Mariano and B. W. Junker, “Covariates of the rating process in hierarchical models for multiple ratings of test items,” Journal of Educational and Behavioral Statistics, vol. 32, no. 3, pp. 287–314, 2007. View at Publisher · View at Google Scholar · View at Scopus
  76. A. E. Beaton, “The NAEP 1983-1984 technical report,” Tech. Rep., ETS, Princeton, NJ, USA, 1987. View at Google Scholar
  77. S. L. Scott and E. H. Ip, “Empirical Bayes and item-clustering effects in a latent variable hierarchical model: a case study from the National Assessment of Educational Progress,” Journal of the American Statistical Association, vol. 97, no. 458, pp. 409–419, 2002. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  78. N. T. Longford, “Model-based methods for analysis of data from the 1990 NAEP trial state assessment,” Research and Development Report 95-696, National Center for Education Statistics, Washington, DC, USA, 1995. View at Google Scholar
  79. S. W. Raudenbush, R. P. Fotiu, and Y. F. Cheong, “Synthesizing results from the trial state assessment,” Journal of Educational and Behavioral Statistics, vol. 24, no. 4, pp. 413–438, 1999. View at Google Scholar · View at Scopus
  80. M. S. Johnson and F. Jenkins, “A Bayesian hierarchical model for large-scale educational surveys: an application to the National Assessment of Educational Progress,” Research Report RR-04-38, ETS, Princeton, NJ, USA, 2005. View at Google Scholar
  81. M. S. Johnson, “A Bayesian hierarchical model for multidimensional performance assessments,” in Proceedings of the Annual Meeting of the National Council on Measurement in Education, New Orleans, La, USA, June 2002.
  82. R. J. Mislevy and R. D. Bock, BILOG 3, Scientific Software, Mooresville, Ind, USA, 2nd edition, 1990.
  83. L. K. Muthén and B. O. Muthén, Mplus User's Guide, Muthén & Muthén, Los Angeles, Calif, USA, 4th edition, 1998–2006.
  84. F. B. Baker, “An investigation of the item parameter recovery characteristics of a Gibbs sampling procedure,” Applied Psychological Measurement, vol. 22, no. 2, pp. 153–169, 1998. View at Google Scholar · View at Scopus
  85. S. Schilling and R. D. Bock, “High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature,” Psychometrika, vol. 70, no. 3, pp. 533–555, 2005. View at Google Scholar · View at MathSciNet
  86. B. O. Muthén, “Beyond SEM: general latent variable modeling,” Behaviormetrika, vol. 29, no. 1, pp. 81–117, 2002. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  87. S. Rabe-Hesketh, A. Skrondal, and A. Pickles, “Generalized multilevel structural equation modeling,” Psychometrika, vol. 69, no. 2, pp. 167–190, 2004. View at Publisher · View at Google Scholar · View at MathSciNet
  88. S. Rabe-Hesketh, A. Skrondal, and A. Pickles, GLLAMM Manual, U.C. Berkeley Division of Biostatistics Working Paper Series, 160, U.C. Berkeley Division of Biostatistics, Berkeley, Calif, USA, 2nd edition, 2004.
  89. E. Segawa, S. Emery, and S. J. Curry, “Extended generalized linear latent and mixed model,” Journal of Educational and Behavioral Statistics, vol. 33, no. 4, pp. 464–484, 2008. View at Publisher · View at Google Scholar · View at Scopus
  90. J. L. Arbuckle, Amos 7.0 User's Guide, SPSS, Chicago, Ill, USA, 2006.
  91. S. Jackman, “pscl: classes and methods for R developed in the political science computational laboratory,” Department of Political Science, Stanford University. Stanford, Calif, USA. R package version 1.03, 2008, http://pscl.stanford.edu/.
  92. A. D. Martin, K. M. Quinn, and J. H. Park, “pscl: MCMCpack. R package version 0.0-6,” 2009, http://cran.r-project.org/web/packages/MCMCpack/index.html.
  93. Y. Sheng, “A MATLAB package for Markov chain Monte Carlo with a multi-unidimensional IRT model,” Journal of Statistical Software, vol. 28, no. 10, pp. 1–20, 2008. View at Google Scholar · View at Scopus
  94. Y. Sheng, “Markov Chain Monte Carlo estimation of normal ogive IRT models in MATLAB,” Journal of Statistical Software, vol. 25, no. 8, pp. 1–15, 2008. View at Google Scholar · View at Scopus
  95. L. Yao, BMIRT: Bayesian Multivariate Item Response Theory, CTB/McGraw-Hill, Monterey, Calif, USA, 2003.
  96. P. Congdon, Bayesian Statistical Modelling, Wiley Series in Probability and Statistics, John Wiley & Sons, Chichester, UK, 2nd edition, 2006. View at MathSciNet
  97. J. Gill, Bayesian Methods: A Social and Behavioral Sciences Approach, Chapman & Hall/CRC, New York, NY, USA, 2nd edition, 2007.
  98. S. Sturtz, U. Ligges, and A. Gelman, “R2WinBUGS: a package for running WinBUGS from R,” Journal of Statistical Software, vol. 12, pp. 1–16, 2005. View at Google Scholar · View at Scopus
  99. M. Plummer, N. Best, K. Cowles, and K. Vines, “coda: output analysis and diagnostics for MCMC,” R package version 0.13-4, 2009, http://cran.r-project.org/web/packages/coda/index.html.
  100. B. J. Smith, “boa: an R package for MCMC output convergence assessment and posterior inference,” Journal of Statistical Software, vol. 21, no. 11, pp. 1–37, 2007. View at Google Scholar · View at Scopus
  101. S. Jackman, “Estimation and inference via Bayesian simulation: an introduction to Markov chain Monte Carlo,” American Journal of Political Science, vol. 44, no. 2, pp. 375–404, 2000. View at Google Scholar · View at Scopus
  102. J.-S. Kim and D. M. Bolt, “Estimating item response theory models using markov chain Monte Carlo methods: An NCME instructional module on,” Educational Measurement: Issues and Practice, vol. 26, no. 4, pp. 38–51, 2007. View at Publisher · View at Google Scholar · View at Scopus
  103. S. Sinharay, “Experiences with Markov chain Monte Carlo convergence assessment in two psychometric examples,” Journal of Educational and Behavioral Statistics, vol. 29, no. 4, pp. 461–488, 2004. View at Google Scholar · View at Scopus
  104. H. Swaminathan, R. K. Hambleton, and H. J. Rogers, “Assessing the fit of item response models,” in Handbook of Statistics, C. R. Rao and S. Sinharay, Eds., vol. 26, pp. 683–718, Elsevier/North-Holland, Amsterdam, The Netherlands, 2007. View at Google Scholar
  105. C. A. W. Glas and J. C. S. Falcón, “A comparison of item-fit statistics for the three-parameter logistic model,” Applied Psychological Measurement, vol. 27, no. 2, pp. 87–106, 2003. View at Publisher · View at Google Scholar · View at Scopus
  106. D. Sörbom, “Model modification,” Psychometrika, vol. 54, no. 3, pp. 371–384, 1989. View at Publisher · View at Google Scholar · View at MathSciNet
  107. L. Hu and P. M. Bentler, “Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives,” Structural Equation Modeling, vol. 6, no. 1, pp. 1–55, 1999. View at Google Scholar
  108. W.-H. Chen and D. Thissen, “Local dependence indexes for item pairs using item response theory,” Journal of Educational and Behavioral Statistics, vol. 22, no. 3, pp. 265–289, 1997. View at Google Scholar · View at Scopus
  109. A. Gelman, X.-L. Meng, and H. Stern, “Posterior predictive assessment of model fitness via realized discrepancies,” Statistica Sinica, vol. 6, no. 4, pp. 733–807, 1996. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  110. D. J. Spiegelhalter, N. G. Best, B. P. Carlin, and A. van der Linde, “Bayesian measures of model complexity and fit,” Journal of the Royal Statistical Society. Series B, vol. 64, no. 4, pp. 583–639, 2002. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  111. J. Fu, D. M. Bolt, and Y. Li, “Evaluating item fit for a polytomous fusion model using posterior predictive checks,” in Proceedings of the Annual Meeting of the National Council on Measurement in Education, Montréal, Canada, April 2005.
  112. R. Levy, R. J. Mislevy, and S. Sinharay, “Posterior predictive model checking for multidimensionality in item response theory,” Applied Psychological Measurement, vol. 33, no. 7, pp. 519–537, 2009. View at Publisher · View at Google Scholar · View at Scopus
  113. Y. Sheng and C. K. Wikle, “Comparing multiunidimensional and unidimensional item response theory models,” Educational and Psychological Measurement, vol. 67, no. 6, pp. 899–919, 2007. View at Publisher · View at Google Scholar · View at MathSciNet
  114. S. Sinharay, “Assessing fit of unidimensional item response theory models using a Bayesian approach,” Journal of Educational Measurement, vol. 42, no. 4, pp. 375–394, 2005. View at Publisher · View at Google Scholar · View at Scopus
  115. S. Sinharay, M. S. Johnson, and H. S. Stern, “Posterior predictive assessment of item response theory models,” Applied Psychological Measurement, vol. 30, no. 4, pp. 298–321, 2006. View at Publisher · View at Google Scholar · View at MathSciNet
  116. M. J. Bayarri and J. O. Berger, “p values for composite null models,” Journal of the American Statistical Association, vol. 95, no. 452, pp. 1127–1142, 2000. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  117. J. M. Robins, A. van der Vaart, and V. Ventura, “Asymptotic distribution of p values in composite null models,” Journal of the American Statistical Association, vol. 95, no. 452, pp. 1143–1172, 2000. View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  118. M. J. Bayarri and J. O. Berger, “Rejoinder,” Journal of the American Statistical Association, vol. 95, pp. 1168–1170, 2000. View at Google Scholar
  119. A. Gelman, “Comment: Bayesian checking of the second levels of hierarchical models,” Statistical Science, vol. 22, no. 3, pp. 349–352, 2007. View at Publisher · View at Google Scholar · View at MathSciNet
  120. V. E. Johnson, “Bayesian model assessment using pivotal quantities,” Bayesian Analysis, vol. 2, no. 4, pp. 719–733, 2007. View at Google Scholar · View at MathSciNet
  121. N. L. Hjort, F. A. Dahl, and G. H. Steinbakk, “Post-processing posterior predictive p-values,” Journal of the American Statistical Association, vol. 101, no. 475, pp. 1157–1174, 2006. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  122. A. Gelman, “A Bayesian formulation of exploratory data analysis and goodness-of-fit testing,” International Statistical Review, vol. 71, no. 2, pp. 369–382, 2003. View at Google Scholar · View at Zentralblatt MATH · View at Scopus
  123. J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference, The Morgan Kaufmann Series in Representation and Reasoning, Morgan Kaufmann, San Mateo, Calif, USA, 1988. View at MathSciNet