×

General model-free weighted envelope estimation. (English) Zbl 07650534

Summary: Envelope methodology is succinctly pitched as a class of procedures for increasing efficiency in multivariate analyses without altering traditional objectives [R. D. Cook, An introduction to envelopes. Dimension reduction for efficient estimation in multivariate statistics. Hoboken, NJ: John Wiley & Sons (2018; Zbl 1407.62014); first sentence of page 1]. This description comes with the additional caveat that efficiency gains obtained by envelope methodology are mitigated by model selection volatility to an unknown degree. Recent strides to account for model selection volatility have been made on two fronts: 1) development of a weighted envelope estimator to account for this variability directly in the context of the multivariate linear regression model; 2) development of model selection criteria that facilitate consistent dimension selection for more general settings. We unify these two directions and provide weighted envelope estimators that directly account for the variability associated with model selection and are appropriate for general multivariate estimation settings. Our weighted estimation technique provides practitioners with robust and useful variance reduction in finite samples. Theoretical and empirical justification is given for our estimators and validity of a nonparametric bootstrap procedure for estimating their asymptotic variance are established. Simulation studies and a real data analysis support our claims and demonstrate the advantage of our weighted envelope estimator when model selection variability is present.

MSC:

62-XX Statistics

Citations:

Zbl 1407.62014
PDFBibTeX XMLCite
Full Text: DOI arXiv Link

References:

[1] BUCKLAND, S. T., BURNHAM, K. P. and AUGUSTIN, N. H. (1997). Model selection: An integral part of inference. Biometrics 53 603-618. · Zbl 0885.62118
[2] BURNHAM, K. P. and ANDERSON, D. R. (2004). Multimodel Inference. Sociological and Methods Research 33 261-304.
[3] CHANG, Y. and PARK, J. Y. (2003). A sieve bootstrap for the test of a unit root. Journal of Time Series Analysis 24 379-400. · Zbl 1036.62070
[4] CLAESKENS, G. and HJORT, N. L. (2008). Model Selection and Model Averaging. Cambridge University Press, Cambridge. · Zbl 1166.62001
[5] COOK, R. D. (2018). An introduction to envelopes: dimension reduction for efficient estimation in multivariate statistics 401. John Wiley & Sons. · Zbl 1407.62014
[6] COOK, R. D., FORZANI, L. and SU, Z. (2016). A Note on Fast Envelope Estimation. Journal of Multivariate Analysis 150 42-54. · Zbl 1345.62082
[7] COOK, R. D., LI, B. and CHIAROMONTE, F. (2010). Envelope models for parsimonious and efficient multivariate linear regression. Statistica Sinica 20 927-1010. · Zbl 1259.62059
[8] COOK, R. D. and SU, Z. (2013). Scaled envelopes: scale-invariant and efficient estimation in multivariate linear regression. Biometrika 100 939-954. · Zbl 1452.62492
[9] COOK, R. D. and SU, Z. (2016). Scaled Predictor Envelopes and Partial Least Squares Regression. Technometrics 58 155-165.
[10] COOK, R. D. and ZHANG, X. (2015). Foundations for Envelope Models and Methods. Journal of the American Statistical Association 110 599-611. · Zbl 1390.62131
[11] COOK, R. D. and ZHANG, X. (2016). Algorithms for Envelope Estimation. Journal of Computational and Graphical Statistics 25 284-300. · doi:10.1080/10618600.2015.1029577
[12] COOK, R. D. and ZHANG, X. (2018). Fast envelope algorithms. Statistica Sinica 28 1179-1197. · Zbl 1394.62067
[13] DING, S. and COOK, R. D. (2018). Matrix variate regressions and envelope models. Journal of the Royal Statistical Society: Series B 80 387-408. · Zbl 06849260
[14] DING, S. and COOK, R. D. (2018). Supplement: Proofs, Technical Details and additional results for “Matrix-variate regressions and envelope models”. Journal of the Royal Statistical Society: Series B 80 387-408. · Zbl 06849260
[15] ECK, D. J. (2018). Bootstrapping for multivariate linear regression models. Statistics and Probability Letters 134 141-149. · Zbl 1463.62202
[16] ECK, D. J. and COOK, R. D. (2017). Weighted envelope estimation to handle variability in model selection. Biometrika 104 743-749. · Zbl 07072240
[17] ECK, D. J., GEYER, C. J. and COOK, R. D. (2018). Supporting Data Analysis for “Combining Envelope Methodology and Aster Models for Variance Reduction in Life History Analyses”. http://hdl.handle.net/11299/178384.1. · Zbl 1437.62196
[18] ECK, D. J., GEYER, C. J. and COOK, R. D. (2020). Combining Envelope Methodology and Aster Models for Variance Reduction in Life History Analyses. Journal of Statistical Planning and Inference 205 283-292. · Zbl 1437.62196
[19] EFRON, B. (2014). Estimation and Accuracy After Model Selection. Journal of the American Statistical Association 109:507 991-1007. · Zbl 1368.62071
[20] FARAWAY, J. (2016). faraway: Functions and Datasets for Books by Julian Faraway https://cran.r-project.org/web/packages/faraway/index.html.
[21] GEYER, C. J., WAGENIUS, S. and SHAW, R. G. (2007). Aster models for life history analysis. Biometrika 94 415-426. · Zbl 1132.62090
[22] HJORT, N. L. and CLAESKENS, G. (2003). Frequentist Model Average Estimators. Journal of the American Statistical Association 98:464 879-899. · Zbl 1047.62003
[23] LEE, M. and SU, Z. (2019). Renvlp: Computing Envelope Estimators http://cran.radicaldevelop.com/web/packages/Renvlp/Renvlp.pdf.
[24] Li, L. and Zhang, X. (2017). Parsimonious tensor response regression. Journal of the American Statistical Association 112 1131-1146.
[25] WORLD HEALTH ORGANIZATION (2011). Use of glycated haemoglobin (HbA1c) in diagnosis of diabetes mellitus: abbreviated report of a WHO consultation Technical Report, Geneva: World Health Organization.
[26] SHAW, R. G., GEYER, C. J., WAGENIUS, S., HANGELBROEK, H. and ETTERSON, J. R. (2008). Unifying life-history analyses for inference of fitness and population growth. American Naturalist 172 E35-E47.
[27] SU, Z. and COOK, R. D. (2011). Partial envelopes for efficient estimation in multivariate linear regression. Biometrika 98 133-146. · Zbl 1214.62062
[28] SU, Z. and COOK, R. D. (2012). Inner envelopes: efficient estimation in multivariate linear regression. Biometrika 99 687-702. · Zbl 1437.62619
[29] SU, Z., ZHU, G., CHEN, X. and YANG, Y. (2016). Sparse Envelope Model: Efficient Estimation and Response Variable Selection in Multivariate Linear Regression. Biometrika 103 579-593. · Zbl 1495.62056
[30] TSAGUE, G. N. (2014). On Optimal Weighting Scheme in Model Averaging. American Journal of Applied Mathematics and Statistics 2(3) 150-156.
[31] WANG, W., ZENG, J. and ZHANG, X. (2021). TRES: Tensor Regression with Envelope Structure, version 1.1.4 https://cran.r-project.org/web/packages/TRES/index.html.
[32] WILLEMS, J. P., SAUNDERS, J. T., HUNT, D. E. and SCHORLING, J. B. (1997). Prevalence of coronary heart disease risk factors among rural blacks: a community-based study. Southern Medical Journal 90 814-820.
[33] YAU, T. S. and VOLAUFOVA, J. (2019). A note on hypothesis testing in the envelope model. Communications in Statistics. DOI: 10.1080/03610918.2019.1672737. · Zbl 07552613
[34] ZENG, J., WANG, W. and ZHANG, Z. (2020). TRES: An R Package for Tensor Regression and Envelope Algorithms. Journal of Statistical Software.
[35] ZHANG, X., LEE, C. E. and SHAO, X. (2020). Envelopes in Multivariate Regression Models with Nonlinearity and Heteroscedasticity. Biometrika. In Press. · Zbl 1457.62174
[36] ZHANG, X. and MAI, Q. (2018). Model-free Envelope Dimension Selection. Electronic Journal of Statistics 12 2193-2216. · Zbl 1410.62086
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.