Efficient standard error formulas of ability estimators with dichotomous item response models. (English) Zbl 1342.62189

Summary: This paper focuses on the computation of asymptotic standard errors (ASE) of ability estimators with dichotomous item response models. A general framework is considered, and ability estimators are defined from a very restricted set of assumptions and formulas. This approach encompasses most standard methods such as maximum likelihood, weighted likelihood, maximum a posteriori, and robust estimators. A general formula for the ASE is derived from the theory of M-estimation. Well-known results are found back as particular cases for the maximum and robust estimators, while new ASE proposals for the weighted likelihood and maximum a posteriori estimators are presented. These new formulas are compared to traditional ones by means of a simulation study under Rasch modeling.


62P15 Applications of statistics to psychology
62F10 Point estimation
62F15 Bayesian inference
62F35 Robustness and adaptive procedures (parametric inference)
Full Text: DOI


[1] Baker, F. B., & Kim, S.-H. (2004). Item response theory: Parameter estimation techniques. New York: Marcel Dekker. · Zbl 1054.62141
[2] Birnbaum, A; Lord, FM (ed.); Novick, MR (ed.), Some latent trait models and their use in inferring an examinee’s ability, 17-20, (1968), Reading, MA
[3] Birnbaum, A, Statistical theory for logistic mental test models with a prior distribution of ability, Journal of Mathematical Psychology, 6, 258-276, (1969) · Zbl 0175.18003
[4] Bock, R. D., & Mislevy, R. J. (1982). Adaptive EAP estimation of ability in a micro computer environment. Applied Psychological Measurement, \(6\), 431-444.
[5] Carroll, R. J., & Pederson, S. (1993). On robustness in the logistic regression model. Journal of the Royal Statistical Society: Series B, 55, 693-706. · Zbl 0794.62021
[6] Doebler, A, The problem of bias in person parameter estimation in adaptive testing, Applied Psychological Measurement, 36, 255-270, (2012)
[7] Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. New York: Erlbaum.
[8] Hambleton, R. K., & Swaminathan, H. (1985). Item response theory: Principles and applications. Boston, MA: Kluwer. · Zbl 0534.62080
[9] Huber, PJ, Robust estimation of a location parameter, Annals of Mathematical Statistics, 35, 73-101, (1964) · Zbl 0136.39805
[10] Huber, P.J. (1967). The behavior of maximum likelihood estimates under non-standard conditions. In Proceeding of the 5th Berkeley Symposium, (vol. 1, pp. 221-233). · Zbl 0596.62114
[11] Huber, P. J. (1981). Robust statistics. New York: Wiley. · Zbl 0536.62025
[12] Koralov, L., & Sinai, Y. G. (2007). Theory of probability and random processes. New York: Springer. · Zbl 1181.60004
[13] Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Lawrence Erlbaum Associates.
[14] Lord, FM, Unbiased estimators of ability parameters, of their variance, and of their parallel-forms reliability, Psychometrika, 48, 233-245, (1983) · Zbl 0534.62080
[15] Lord, FM, Maximum likelihood and Bayesian parameter estimation in item response theory, Journal of Educational Measurement, 23, 157-162, (1986)
[16] Magis, D, On the asymptotic standard error of a class of robust estimators of ability in dichotomous item response models, British Journal of Mathematical and Statistical Psychology, 67, 430-450, (2014) · Zbl 1406.91370
[17] Magis, D, Accuracy of asymptotic standard errors of the maximum and weighted likelihood estimators of proficiency levels with short tests, Applied Psychological Measurement, 38, 105-121, (2014)
[18] Magis, D., & Raîche, G. (2012). Random generation of response patterns under computerized adaptive testing with the R package catR. Journal of Statistical Software, 48, 1-31.
[19] Mislevy, RJ, Bayes modal estimation in item response theory, Psychometrika, 51, 177-195, (1986) · Zbl 0596.62114
[20] Mislevy, R. J., & Bock, R. D. (1982). Biweight estimates of latent ability. Educational and Psychological Measurement, 42, 725-737. doi:10.1177/001316448204200302.
[21] Mosteller, F., & Tukey, J. (1977). Exploratory data analysis and regression. Reading, MA: Addison-Wesley.
[22] Nydick, S.W. (2013). catIrt: An R package for simulating IRT-based computerized adaptive tests. R package version 0.4-1. · Zbl 0175.18003
[23] Ogasawara, H, Asymptotic properties of the Bayes and pseudo Bayes estimators of ability in item response theory, Journal of Multivariate Analysis, 114, 359-377, (2013) · Zbl 1255.62069
[24] Ogasawara, H. (2013b). Asymptotic cumulants of the ability estimators using fallible item parameters. Journal of Multivariate Analysis, 119, 144-162. doi:10.1016/j.jmva.2013.04.008. · Zbl 1277.62271
[25] Partchev, I. (2012). irtoys: Simple interface to the estimation and plotting of IRT models. R package version 0.1.6.
[26] Patton, J. M., Cheng, Y., Yuan, K.-H., & Diao, Q. (2013). The influence of item calibration error on variable-length computerized adaptive testing. Applied Psychological Measurement, 37, 24-40. doi:10.1177/0146621612461727.
[27] R Core Team (2014). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing.
[28] Rao, M. M. (1984). Probability theory with applications. New York: Academic Press. · Zbl 0535.60001
[29] Reif, M. (2014). PP: Estimation of person parameters for the 1, 2, 3, 4-PL model and the GPCM. R package version 0.5.3.
[30] Schuster, C; Yuan, K-H, Robust estimation of latent ability in item response models, Journal of Educational and Behavioral Statistics, 36, 720-735, (2011)
[31] Sijisma, K., & Molenaar, I. W. (2002). Introduction to nonparametric item response theory. Thousand Oaks, CA: Sage.
[32] Stefanski, L. A., & Boos, D. D. (2002). The calculus of M-estimation. The American Statistician, 56, 29-38. doi:10.1198/000313002753631330.
[33] Wainer, H. (2000). Computerized adaptive testing: A primer (2nd ed.). Mahwah, NJ: Lawrence Erlbaum Associates.
[34] Wainer, H., & Wright, B. D. (1980). Robust estimation of ability in the Rasch model. Psychometrika, 45, 373-391. doi:10.1007/BF02293910. · Zbl 0446.62113
[35] Warm, TA, Weighted likelihood estimation of ability in item response models, Psychometrika, 54, 427-450, (1989)
[36] Warm, TA, Warm (maximum) likelihood estimates of rasch measures, Rasch Measurement Transactions, 21, 1094, (2007)
[37] Wu, M. L., Adams, R. J., & Wilson, M. R. (1997). ConQuest: Multi-aspect test software [Computer program]. Camberwell, Australia: Australian Council for Educational Research. · Zbl 1255.62069
[38] Yuan, K.-H., & Jennrich, R. I. (1998). Asymptotics of estimating equations under natural conditions. Journal of Multivariate Analysis, 65, 245-260. doi:10.1006/jmva.1997.1731. · Zbl 1126.62329
[39] Zeileis, A, Object-oriented computation of sandwich estimators, Journal of Statistical Software, 16, 1-16, (2006)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.