×

Semiparametric mixtures of regressions with single-index for model based clustering. (English) Zbl 1474.62137

Summary: In this article, we propose two classes of semiparametric mixture regression models with single-index for model based clustering. Unlike many semiparametric/nonparametric mixture regression models that can only be applied to low dimensional predictors, the new semiparametric models can easily incorporate high dimensional predictors into the nonparametric components. The proposed models are very general, and many of the recently proposed semiparametric/nonparametric mixture regression models are indeed special cases of the new models. Backfitting estimates and the corresponding modified EM algorithms are proposed to achieve optimal convergence rates for both parametric and nonparametric parts. We establish the identifiability results of the proposed two models and investigate the asymptotic properties of the proposed estimation procedures. Simulation studies are conducted to demonstrate the finite sample performance of the proposed models. Two real data applications using the new models reveal some interesting findings.

MSC:

62G08 Nonparametric regression and quantile regression
62E20 Asymptotic distribution theory in statistics
62H30 Classification and discrimination; cluster analysis (statistical aspects)
PDFBibTeX XMLCite
Full Text: DOI arXiv

References:

[1] Cao, J.; Yao, W., Semiparametric mixture of binomial regression with a degenerate component, Statistica Sinica, 22, 27-46 (2012) · Zbl 1417.62054 · doi:10.5705/ss.2010.062
[2] Chatterjee, S.; Handcock, MS; Simmonoff, JS, A casebook for a first course in statistics and data analysis (1995), New York: Wiley, New York · Zbl 0833.62001
[3] Chen, J.; Li, P., Hypothesis test for normal mixture models: the EM approach, Ann Stat, 37, 2523-2542 (2009) · Zbl 1173.62007 · doi:10.1214/08-AOS651
[4] Cook, RD; Li, B., Dimension reduction for conditional mean in regression, Ann Stat, 30, 455-474 (2002) · Zbl 1012.62035 · doi:10.1214/aos/1021379861
[5] Fan, J.; Zhang, C.; Zhang, J., Generalized likelihood ratio statistics and Wilks phenomenon, Ann Stat, 29, 153-193 (2001) · Zbl 1029.62042 · doi:10.1214/aos/996986505
[6] Frühwirth-Schnatter, S., Markov chain Monte Carlo estimation of classical and dynamic switching and mixture models, J Am Stat Assoc, 96, 194-209 (2001) · Zbl 1015.62022 · doi:10.1198/016214501750333063
[7] Green, PJ; Richardson, S., Hidden markov models and disease mapping, J Am Stat Assoc, 97, 1055-1070 (2002) · Zbl 1046.62117 · doi:10.1198/016214502388618870
[8] Härdle, W.; Hall, P.; Ichimura, H., Optimal smoothing in single-index models, Ann Stat, 21, 157-178 (1993) · Zbl 0770.62049 · doi:10.1214/aos/1176349020
[9] Henning, C., Identifiability of models for clusterwise linear regression, J Classif, 17, 273-296 (2000) · Zbl 1017.62058 · doi:10.1007/s003570000022
[10] Hu, H.; Yao, W.; Wu, Y., The robust EM-type algorithms for log-concave mixtures of regression models, Comput Stat Data Anal, 111, 14-26 (2017) · Zbl 1464.62092 · doi:10.1016/j.csda.2017.01.004
[11] Huang, M.; Yao, W., Mixture of regression models with varying mixing proportions: a semiparametric approach, J Am Stat Assoc, 107, 711-724 (2012) · Zbl 1261.62036 · doi:10.1080/01621459.2012.682541
[12] Huang, M.; Li, R.; Wang, S., Nonparametric mixture of regression models, J Am Stat Assoc, 108, 929-941 (2013) · Zbl 06224977 · doi:10.1080/01621459.2013.772897
[13] Huang, M.; Li, R.; Wang, H.; Yao, W., Estimating mixture of Gaussian processes by kernel smoothing, J Bus Econ Stat, 32, 259-270 (2014) · doi:10.1080/07350015.2013.868084
[14] Ichimura, H., Semiparametric least squares (SLS) and weighted SLS estimation of single-index models, J Econom, 58, 71-120 (1993) · Zbl 0816.62079 · doi:10.1016/0304-4076(93)90114-K
[15] Jordan, MI; Jacobs, RA, Hierarchical mixtures of experts and the EM algorithm, Neural Comput, 6, 181-214 (1994) · doi:10.1162/neco.1994.6.2.181
[16] Li, K., Sliced inverse regression for dimension reduction, J Am Stat Assoc, 86, 414, 316-327 (1991) · Zbl 0742.62044 · doi:10.1080/01621459.1991.10475035
[17] Li, P.; Chen, J., Testing the order of a finite mixture, J Am Stat Assoc, 105, 1084-1092 (2010) · Zbl 1390.62024 · doi:10.1198/jasa.2010.tm09032
[18] Li, B.; Zha, H.; Chiaromonte, F., Contour regression: a general approach to dimension reduction, Ann Stat, 33, 1580-1616 (2005) · Zbl 1078.62033 · doi:10.1214/009053605000000192
[19] Luo, R.; Wang, H.; Tsai, CL, Contour projected dimension reduction, Ann Stat, 37, 3743-3778 (2009) · Zbl 1360.62184 · doi:10.1214/08-AOS679
[20] Ma, Y.; Zhu, L., A semiparametric approach to dimension reduction, J Am Stat Assoc, 107, 497, 168-179 (2012) · Zbl 1261.62037 · doi:10.1080/01621459.2011.646925
[21] Ma, Y.; Zhu, L., Efficient estimation in sufficient dimension reduction, Ann Stat, 41, 250-268 (2013) · Zbl 1347.62089 · doi:10.1214/12-AOS1072
[22] Shao, J., Linear models selection by cross-validation, J Am Stat Assoc, 88, 486-494 (1993) · Zbl 0773.62051 · doi:10.1080/01621459.1993.10476299
[23] Stephens, M., Dealing with label switching in mixture models, J R Stat Soc B, 62, 795-809 (2000) · Zbl 0957.62020 · doi:10.1111/1467-9868.00265
[24] Titterington, D.; Smith, A.; Makov, U., Statistical analysis of finite mixture distribution (1985), New York: Wiley, New York · Zbl 0646.62013
[25] Wang, H.; Xia, Y., Sliced regression for dimension reduction, J Am Stat Assoc, 103, 811-821 (2008) · Zbl 1306.62168 · doi:10.1198/016214508000000418
[26] Wang, Q.; Yao, W., An adaptive estimation of MAVE, J Multivar Anal, 104, 88-100 (2012) · Zbl 1352.62060 · doi:10.1016/j.jmva.2011.07.001
[27] Wang, S.; Yao, W.; Huang, M., A note on the identiability of nonparametric and semiparametric mixtures of GLMs, Stat Probab Lett, 93, 41-45 (2014) · Zbl 1400.62159 · doi:10.1016/j.spl.2014.06.010
[28] Wedel, M.; DeSarbo, WS, A latent class binomial logit methodology for the analysis of paired comparison data, Decis Sci, 24, 1157-1170 (1993) · doi:10.1111/j.1540-5915.1993.tb00508.x
[29] Xiang, S.; Yao, W., Semiparametric mixtures of nonparametric regressions, Ann Inst Stat Math, 70, 131-154 (2018) · Zbl 1385.62014 · doi:10.1007/s10463-016-0584-7
[30] Xiang, S.; Yao, W.; Yang, G., An overview of semiparametric extensions of finite mixture models, Stat Sci, 34, 391-404 (2019) · Zbl 1429.62272 · doi:10.1214/19-STS698
[31] Yao, W.; Lindsay, BG, Bayesian mixture labeling by highest posterior density, J Am Stat Assoc, 104, 758-767 (2009) · Zbl 1388.62007 · doi:10.1198/jasa.2009.0237
[32] Yao, W.; Nandy, D.; Lindsay, B.; Chiaromonte, F., Covariate information matrix for sufficient dimension reduction, J Am Stat Assoc, 114, 1752-1764 (2019) · Zbl 1428.62341 · doi:10.1080/01621459.2018.1515080
[33] Young, DS; Hunter, DR, Mixtures of regressions with predictors dependent mixing proportions, Comput Stat Data Anal, 54, 2253-2266 (2010) · Zbl 1284.62467 · doi:10.1016/j.csda.2010.04.002
[34] Zeng, P., Finite mixture of heteroscedastic single-index models, Open J Stat, 2, 12-20 (2012) · doi:10.4236/ojs.2012.21002
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.