×

The transfer principle: a tool for complete case analysis. (English) Zbl 1296.62040

Summary: This paper gives a general method for deriving limiting distributions of complete case statistics for missing data models from corresponding results for the model where all data are observed. This provides a convenient tool for obtaining the asymptotic behavior of complete case versions of established full data methods without lengthy proofs. { } The methodology is illustrated by analyzing three inference procedures for partially linear regression models with responses missing at random. We first show that complete case versions of asymptotically efficient estimators of the slope parameter for the full model are efficient, thereby solving the problem of constructing efficient estimators of the slope parameter for this model. Second, we derive an asymptotically distribution free test for fitting a normal distribution to the errors. Finally, we obtain an asymptotically distribution free test for linearity, that is, for testing that the nonparametric component of these models is a constant. This test is new both when data are fully observed and when data are missing at random.

MSC:

62E20 Asymptotic distribution theory in statistics
62G05 Nonparametric estimation
62G10 Nonparametric hypothesis testing

References:

[1] Bhattacharya, P. K. and Zhao, P.-L. (1997). Semiparametric inference in a partial linear model. Ann. Statist. 25 244-262. · Zbl 0869.62050 · doi:10.1214/aos/1034276628
[2] Cuzick, J. (1992). Efficient estimates in semiparametric additive regression models with unknown error distribution. Ann. Statist. 20 1129-1136. · Zbl 0746.62037 · doi:10.1214/aos/1176348675
[3] Efromovich, S. (2011). Nonparametric regression with responses missing at random. J. Statist. Plann. Inference 141 3744-3752. · Zbl 1235.62045 · doi:10.1016/j.jspi.2011.06.017
[4] Forrester, J., Hooper, W., Peng, H. and Schick, A. (2003). On the construction of efficient estimators in semiparametric models. Statist. Decisions 21 109-137. · Zbl 1040.62029 · doi:10.1524/stnd.21.2.109.19007
[5] González-Manteiga, W. and Pérez-González, A. (2006). Goodness-of-fit tests for linear regression models with missing response data. Canad. J. Statist. 34 149-170. · Zbl 1096.62041 · doi:10.1002/cjs.5550340111
[6] Hart, J. D. (1997). Nonparametric Smoothing and Lack-of-Fit Tests . Springer, New York. · Zbl 0886.62043
[7] Khmaladze, E. V. and Koul, H. L. (2004). Martingale transforms goodness-of-fit tests in regression models. Ann. Statist. 32 995-1034. · Zbl 1092.62052 · doi:10.1214/009053604000000274
[8] Khmaladze, E. V. and Koul, H. L. (2009). Goodness-of-fit problem for errors in nonparametric regression: Distribution free approach. Ann. Statist. 37 3165-3185. · Zbl 1369.62073 · doi:10.1214/08-AOS680
[9] Koul, H. L. (2002). Weighted Empirical Processes in Dynamic Nonlinear Models. Lecture Notes in Statistics 166 . Springer, New York. · Zbl 1007.62047 · doi:10.1007/978-1-4613-0055-7
[10] Koul, H. L. (2006). Model diagnostics via martingale transforms: A brief review. In Frontiers in Statistics (J. Fan and H. L. Koul, eds.) 183-206. Imp. Coll. Press, London. · Zbl 1121.62050 · doi:10.1142/9781860948886_0009
[11] Koul, H. L. and Ni, P. (2004). Minimum distance regression model checking. J. Statist. Plann. Inference 119 109-141. · Zbl 1032.62036 · doi:10.1016/S0378-3758(02)00415-9
[12] Li, X. (2012). Lack-of-fit testing of a regression model with response missing at random. J. Statist. Plann. Inference 142 155-170. · Zbl 1227.62027 · doi:10.1016/j.jspi.2011.07.005
[13] Little, R. J. A. and Rubin, D. B. (2002). Statistical Analysis with Missing Data , 2nd ed. Wiley, Hoboken, NJ. · Zbl 1011.62004
[14] Müller, U. U. (2009). Estimating linear functionals in nonlinear regression with responses missing at random. Ann. Statist. 37 2245-2277. · Zbl 1173.62052 · doi:10.1214/08-AOS642
[15] Müller, U. U., Schick, A. and Wefelmeyer, W. (2006). Imputing responses that are not missing. In Probability , Statistics and Modelling in Public Health (M. Nikulin, D. Commenges and C. Huber, eds.) 350-363. Springer, New York.
[16] Müller, U. U., Schick, A. and Wefelmeyer, W. (2012). Estimating the error distribution function in semiparametric additive regression models. J. Statist. Plann. Inference 142 552-566. · Zbl 1231.62069 · doi:10.1016/j.jspi.2011.08.013
[17] Schick, A. (1993). On efficient estimation in regression models. Ann. Statist. 21 1486-1521. · Zbl 0807.62035 · doi:10.1214/aos/1176349269
[18] Schick, A. (1996). Root-\(n\)-consistent and efficient estimation in semiparametric additive regression models. Statist. Probab. Lett. 30 45-51. · Zbl 0903.62035 · doi:10.1016/0167-7152(95)00200-6
[19] Stute, W., Xu, W. L. and Zhu, L. X. (2008). Model diagnosis for parametric regression in high-dimensional spaces. Biometrika 95 451-467. · Zbl 1437.62614 · doi:10.1093/biomet/asm095
[20] Sun, Z. and Wang, Q. (2009). Checking the adequacy of a general linear model with responses missing at random. J. Statist. Plann. Inference 139 3588-3604. · Zbl 1167.62037 · doi:10.1016/j.jspi.2009.04.024
[21] Sun, Z., Wang, Q. and Dai, P. (2009). Model checking for partially linear models with missing responses at random. J. Multivariate Anal. 100 636-651. · Zbl 1163.62032 · doi:10.1016/j.jmva.2008.07.002
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.