Goodness-of-fit tests for mixed model diagnostics. (English) Zbl 1041.62062

Summary: A simple goodness of fit test is proposed for checking distributional assumptions involved in a mixed linear model. An estimated critical value of the test statistic is derived, and is shown to be asymptotically correct under mild conditions. As a special case, the test may be applied to linear regression models to formally check the distribution of the errors. Finite sample performance of the proposed test is examined and compared with that of a previously proposed test by simulations.


62J20 Diagnostics, and linear inference and regression
62G10 Nonparametric hypothesis testing
62J05 Linear regression; mixed models
Full Text: DOI


[1] Arora,V.,Lahiri,P. and Mukherjee, K. (1997). Empirical Bayes estimation of finite population means from complex surveys. J. Amer. Statist. Assoc. 92 1555-1562. JSTOR: · Zbl 0912.62012 · doi:10.2307/2965426
[2] Battese,G. E.,Harter,R. M. and Fuller, W. A. (1988). An error-components model for prediction of county crop areas using survey and satellite data. J. Amer. Statist. Assoc. 83 28-36.
[3] Bhatia, R. (1997). Matrix Analysis. Springer, New York. · Zbl 0863.15001
[4] Borovkov, A. A. (1977). On the power of the 2 test with increasing number of class intervals. Theory Probab. Appl. 22 366-370. · Zbl 0497.62040 · doi:10.1137/1122040
[5] Calvin, J. A. and Sedransk, J. (1991). Bayesian and frequentist predictive inference for the patterns of care studies. J. Amer. Statist. Assoc. 86 36-48.
[6] Chernoff, H. and Lehmann, E. L. (1954). The use of maximum-likelihood estimates in 2 tests for goodness of fit. Ann. Math. Statist. 25 579-586. · Zbl 0056.37103 · doi:10.1214/aoms/1177728726
[7] Dempster, A. P. and Ryan, L. M. (1985). Weighted normal plots. J. Amer. Ststist. Assoc. 80 845- 850.
[8] Ghosh, M. and Rao, J. N. K. (1994). Small area estimation: An appraisal. Statist. Sci. 9 55-93. · Zbl 0955.62538 · doi:10.1214/ss/1177010647
[9] Hall, P. and Heyde, C. C. (1980). Martingale Limit Theory and Its Application. Academic Press, New York. · Zbl 0462.60045
[10] Gray, R. J. and Pierce, D. A. (1985). Goodness-of-fit tests for censored survival data. Ann. Statist. 13 552-563. · Zbl 0576.62029 · doi:10.1214/aos/1176349538
[11] Jiang, J. (1996). REML estimation: Asymptotic behavior and related topics. Ann. Statist. 24 255- 286. · Zbl 0853.62022 · doi:10.1214/aos/1033066209
[12] Jiang, J. (1997). Wald consistency and the method of sieves in REML estimation. Ann. Statist. 25 1781-1803. · Zbl 0890.62020 · doi:10.1214/aos/1031594742
[13] Jiang, J. (1998). Asymptotic properties of the empirical BLUP and BLUE in mixed linear models. Statist. Sinica 8 861-885. Jiang, J. (2001a). Empirical method of moments and its applications. J. Statist. Plann. Inference. To appear. Jiang, J. (2001b). Goodness-of-fit tests for mixed model diagnostics: details and extensions. Technical report, Dept. Statistics, Case Western Reserve Univ. · Zbl 0901.62038
[14] Jiang,J.,Lahiri,P. and Wu, C. (2001). A generalization of the Pearson’s 2 goodness-of-fit test with estimated cell frequencies. Sankhy\?a Ser. A. · Zbl 0995.62048
[15] Kallenberg,W. C.,Odsterhoff,J. and Schriever, B. F. (1985). The number of classes in chisquared goodness-of-fit tests. J. Amer. Statist. Assoc. 80 959-968. JSTOR: · Zbl 0582.62037 · doi:10.2307/2288561
[16] Lange, N. and Ryan, L. (1989). Assessing normality in random effects models. Ann. Statist. 17 624-642. · Zbl 0672.62081 · doi:10.1214/aos/1176347130
[17] Lehmann, E. L. (1999). Elements of Large-Sample Theory. Springer, New York. · Zbl 0914.62001
[18] Mammen, E. (1996). Empirical process of residuals for high-dimensional linear models. Ann. Statist. 24 307-335. · Zbl 0853.62042 · doi:10.1214/aos/1033066211
[19] Mann, H. B. and Wald, A. (1942). On the choice of the number of intervals in the application of the chi-squared test. Ann. Math. Statist. 13 306-317. · Zbl 0063.03772 · doi:10.1214/aoms/1177731569
[20] Moore, D. S. (1978). Chi-square tests. In Studies in Statistics (R. V. Hogg, ed.) 66-106. Math. Assoc. America, Washington, DC. · Zbl 0413.62026
[21] Neyman,J. and Scott, E. (1948). Consistent estimates based on partially consistent observations. Econometrika 16 1-32. JSTOR: · Zbl 0034.07602 · doi:10.2307/1914288
[22] Pierce,D. A. and Kopecky, K. J. (1979). Testing goodness of fit for the distribution of errors in regression models. Biometrika 66 1-5. JSTOR: · Zbl 0395.62049 · doi:10.1093/biomet/66.1.1
[23] Portnoy, S. (1984). Asymptotic behavior of M-estimators of p regression parameters when p2/n is large. I. Ann. Statist. 12 1298-1309. · Zbl 0584.62050 · doi:10.1214/aos/1176346793
[24] Prasad, N. G. N. and Rao, J. N. K. (1990). The estimation of the mean squared error of small-area estimators. J. Amer. Statist. Assoc. 85 163-171. JSTOR: · Zbl 0719.62064 · doi:10.2307/2289539
[25] Rayner, J. C. W. (1982). The choice of class probabilities and number of classes for the simple 2 goodness of fit test. Sankhy \?a Ser. B 44 28-37.
[26] Rayner, J. C. W. and Best, D. J. (1989). Smooth Tests of Goodness of Fit. Oxford Univ. Press. Searle,S. R.,Casella,G. and McCulloch, C. E. (1992) Variance Components. Wiley, New York. · Zbl 0731.62064
[27] Senatov, V. V. (1980). Uniform estimates of the rate of convergence in the multi-dimensional central limit theorem. Theory Probab. Appl. 25 745-759. · Zbl 0471.60031 · doi:10.1137/1125089
[28] Verbeke, G. and Lesaffre, E. (1996). A linear mixed-effects model with heterogeneity in the random-effects population. J. Amer. Statist. Assoc. 91 217-221. · Zbl 0870.62057 · doi:10.2307/2291398
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.