Adaptive estimation in the supremum norm for semiparametric mixtures of regressions. (English) Zbl 1440.62279

A two-component semiparametric mixture of regressions model is studied \[ Y=W(\mu(X)+\epsilon_1)+(1-W)\sigma(X)\epsilon_2, \] where a random vector \(X\) has a compact support. The explanatory variable \(X\) and the response variable \(Y\) are observable while the latent variable \(W\) and error variables \(\epsilon_1\) and \(\epsilon_2\) are not. The unknown location and scaling functions \(\mu(\cdot)\) and \(\sigma(\cdot)\) partially determine the distributional relationship between \(X\) and \(Y\) along with the unknown mixing function \(p(\cdot).\) Conditionally on \(\{X=x\},\) the variable \(W\) has a Bernoulli distribution with parameter \(p(x).\) Conditionally on \(\{X=x\},\) the vectors \(\epsilon_1\) and \(\epsilon_2\) have zero-symmetric conditional pdfs \(f_x\) and \(\bar{f},\) respectively, where \(\bar{f}\) is assumed known and not to depend on \(x\), while \(f_x\) is unknown and may depend on \(x.\) The functional parameter \(\theta(x)=(p(x), \sigma(x), \mu(x), f_x)\) collects all the \(x\)-local quantities to be estimated from the data.
Based on \(n\) independent copies of the model, local M-estimator of \(\theta(\cdot)\) is constructed which converges in the sup-norm at the optimal rates over Hölder-smoothness classes. An adaptive version of the estimator is introduced based on the method in [O. V. Lepskij, Theory Probab. Appl. 36, No. 4, 682–697 (1991; Zbl 0776.62039); translation from Teor. Veroyatn. Primen. 36, No. 4, 645–659 (1991)]. Simulations investigate the finite-sample behavior of the method, and an illustration is given to a real data set from bioinformatics.


62J05 Linear regression; mixed models
62H30 Classification and discrimination; cluster analysis (statistical aspects)
62P10 Applications of statistics to biology and medical sciences; meta analysis


Zbl 0776.62039


Full Text: DOI Euclid


[1] Bordes, L., C. Delmas, and P. Vandekerkhove (2006). Semiparametric estimation of a two-component mixture model where one component is known., Scandinavian Journal of Statistics 33 (4), 733-752. · Zbl 1164.62331
[2] Bordes, L., I. Kojadinovic, and P. Vandekerkhove (2013). Semiparametric estimation of a two-component mixture of linear regressions in which one component is known., Electronic Journal of Statistics 7, 2603-2644. · Zbl 1294.62151
[3] Bordes, L. and P. Vandekerkhove (2010). Semiparametric two-component mixture model with a known component: an asymptotically normal estimator., Mathematical Methods of Statistics 19 (1), 22-41. · Zbl 1282.62068
[4] Butucea, C., R. N. Tzoumpe, and P. Vandekerkhove (2017). Semiparametric topographical mixture models with symmetric errors., Bernoulli 23 (2), 825-862. · Zbl 1384.62211
[5] Butucea, C. and P. Vandekerkhove (2014). Semiparametric mixtures of symmetric distributions., Scandinavian Journal of Statistics 41 (1), 227-239. · Zbl 1349.62094
[6] Chen, X. and K. Kato (2019). Jackknife multiplier bootstrap: finite sample approximations to the U-process supremum with applications., Probability Theory and Related Fields , 1-67.
[7] Chernozhukov, V., D. Chetverikov, K. Kato, et al. (2014). Anti-concentration and honest, adaptive confidence bands., The Annals of Statistics 42 (5), 1787-1818. · Zbl 1305.62161
[8] Compiani, G. and Y. Kitamura (2016). Using mixtures in econometric models: a brief review and some new results., The Econometrics Journal 19 (3), C95-C127.
[9] De Veaux, R. D. (1989). Mixtures of linear regressions., Computational Statistics & Data Analysis 8 (3), 227-245. · Zbl 0726.62109
[10] Driver, B. K. (2003, June). Analysis tools with applications., Lecture Notes .
[11] Giné, E., R. Latała, and J. Zinn (2000). Exponential and moment inequalities for U-statistics. In, High Dimensional Probability II, Volume 47, pp. 13-38. Springer. · Zbl 0969.60024
[12] Golubev, F., O. Lepski, and B. Levit (2001). On adaptive estimation using the sup-norm losses., Mathematical Methods of Statistics 10, 23-37. · Zbl 1005.62033
[13] Hohmann, D. and H. Holzmann (2013). Semiparametric location mixtures with distinct components., Statistics 47 (2), 348-362. · Zbl 1440.62108
[14] Huang, M., R. Li, and S. Wang (2013). Nonparametric mixture of regression models., Journal of the American Statistical Association 108 (503), 929-941. · Zbl 06224977
[15] Huang, M. and W. Yao (2012). Mixture of regression models with varying mixing proportions: a semiparametric approach., Journal of the American Statistical Association 107 (498), 711-724. · Zbl 1261.62036
[16] Hunter, D. R. and D. S. Young (2012). Semiparametric mixtures of regressions., Journal of Nonparametric Statistics 24 (1), 19-38. · Zbl 1241.62055
[17] Lepski, O. V. and V. G. Spokoiny (1997). Optimal pointwise adaptive methods in nonparametric estimation., The Annals of Statistics 25 (6), 2512-2546. · Zbl 0894.62041
[18] Lepskii, O. (1992). Asymptotically minimax adaptive estimation. i: Upper bounds. optimally adaptive estimates., Theory of Probability & Its Applications 36 (4), 682-697.
[19] Martin-Magniette, M.-L., T. Mary-Huard, C. Bérard, and S. Robin (2008). Chipmix: mixture model of regressions for two-color chip-chip analysis., Bioinformatics 24 (16), i181-i186.
[20] Quandt, R. E. and J. B. Ramsey (1978). Estimating mixtures of normal distributions and switching regressions., Journal of the American statistical Association 73 (364), 730-738. · Zbl 0401.62024
[21] Rosenblatt, M. (1969). Conditional probability density and regression estimators., Multivariate analysis II (Proc. Second Internat. Sympos., Dayton, Ohio, 1968) 25, 25-31.
[22] Städler, N., P. Bühlmann, and S. Van De Geer (2010). \( \ell -1\)-penalization for mixture regression models., Test 19 (2), 209-256. · Zbl 1203.62128
[23] Stone, C. J. (1982). Optimal global rates of convergence for nonparametric regression., The Annals of Statistics 10 (4), 1040-1053. · Zbl 0511.62048
[24] van der Vaart, A. and J. A. Wellner (1996)., Weak convergence and empirical processes. With applications to statistics. New York, NY: Springer. · Zbl 0862.60002
[25] Vandekerkhove, P. (2013). Estimation of a semiparametric mixture of regressions model., Journal of Nonparametric Statistics 25 (1), 181-208. · Zbl 1297.62076
[26] Zhu, H.-T. and H. Zhang (2004). Hypothesis testing in mixture regression models., Journal of the Royal Statistical Society: Series B (Statistical Methodology) 66 (1), 3-16. · Zbl 1062.62033
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.