Power analysis for the bootstrap likelihood ratio test for the number of classes in latent class models. (English) Zbl 1414.62061

Summary: Latent class (LC) analysis is used to construct empirical evidence on the existence of latent subgroups based on the associations among a set of observed discrete variables. One of the tests used to infer about the number of underlying subgroups is the bootstrap likelihood ratio test (BLRT). Although power analysis is rarely conducted for this test, it is important to identify, clarify, and specify the design issues that influence the statistical inference on the number of latent classes based on the BLRT. This paper proposes a computationally efficient ‘short-cut’ method to evaluate the power of the BLRT, as well as presents a procedure to determine a required sample size to attain a specific power level. Results of our numerical study showed that this short-cut method yields reliable estimates of the power of the BLRT. The numerical study also showed that the sample size required to achieve a specified power level depends on various factors of which the class separation plays a dominant role. In some situations, a sample size of 200 may be enough, while in others 2000 or more subjects are required to achieve the required power.


62F03 Parametric hypothesis testing
62F05 Asymptotic properties of parametric tests
62H30 Classification and discrimination; cluster analysis (statistical aspects)


Latent GOLD; Mplus
Full Text: DOI


[1] Akaike, H., A new look at the statistical model identification, IEEE Trans Autom Control, 19, 716-723, (1974) · Zbl 0314.62039
[2] Bock, HH, Probabilistic models in cluster analysis, Comput Stat Data Anal, 23, 6-28, (1996) · Zbl 0900.62324
[3] Cohen J (1988) Statistical power analysis for the behavioral sciences. Lawrence Erlbaum, New Jersey · Zbl 0747.62110
[4] Collins LM, Lanza ST (2010) Latent class and latent transition analysis: with applications in the social, behavioral, and health sciences. Wiley, New Jersey
[5] Davidson, R.; MacKinnon, JG, The power of bootstrap and asymptotic tests, J Econom, 133, 421-441, (2006) · Zbl 1345.62058
[6] Dias, JG; Vermunt, JK, Latent class modeling of website users’ search patterns: implications for online market segmentation, J Retail Consum Serv, 14, 359-368, (2007)
[7] Everitt, BS, A Monte Carlo investigation of the likelihood ratio test for the number of components in a mixture of normal distributions, Multivar Behav Res, 16, 171-180, (1981)
[8] Genge, E., A latent class analysis of the public attitude towards the euro adoption in Poland, Adv Data Anal Classif, 8, 427-442, (2014)
[9] Hartigan, JA; Ryzin, JV (ed.), Distribution problems in clustering, 45-72, (1977), New York
[10] Holt, JA; Macready, GB, A simulation study of the difference Chi-square statistic for comparing latent class models under violation of regularity conditions, Appl Psychol Meas, 13, 221-231, (1989)
[11] Jeffries, NO, A note on ’testing the number of components in a normal mixture, Biometrika, 90, 991-994, (2003) · Zbl 1436.62073
[12] Johnson, VE; Rossell, D., On the use of non-local prior densities in Bayesian hypothesis tests, J R Stat Soc, 27, 143-170, (2010)
[13] Langeheine, R.; Pannekoek, J.; Pol, F., Bootstrapping goodness-of-fit measures in categorical data analysis, Sociol Methods Res, 24, 492-616, (1996)
[14] Lazarsfeld PF, Henry NW (1968) Latent Structure Analysis. Houghton Mifflin, Boston · Zbl 0182.52201
[15] Leask, SJ; Vermunt, JK; Done, DJ; Crowd, TJ; Blows, M.; Boks, MP, Beyond symptom dimensions: Schizophrenia risk factors for patient groups derived by latent class analysis, Schizophr Res, 115, 346-350, (2009)
[16] Lo, YT; Mendell, NR; Rubin, DB, Testing the number of components in a normal mixture, Biometrika, 88, 767-778, (2001) · Zbl 0985.62019
[17] Magidson, J.; Vermunt, JK; Kaplan, D. (ed.), Latent class models, 175-198, (2004), Thousand Oakes
[18] McLachlan, G., On bootstrapping the likelihood ratio test statistic for the number of components in a normal mixture, Appl Stat J R Stat Soc, 36, 318-324, (1987)
[19] McLachlan G, Basford K (1988) Mixture models: inference and applications to clustering. Marcel Dekker, New York · Zbl 0697.62050
[20] McLachlan G, Peel D (2000) Finite mixture models. Wiley, New York · Zbl 0963.62061
[21] Muthén LK, Muthén BO (1998-2010) Mplus User’s Guide. Sixth Edition, Muthén & Muthén, Los Angeles, CA
[22] Nylund, KL; Muthen, M.; Muthen, BO, Deciding on the number of classes in latent class analysis and growth mixture modeling: a monte carlo simulation study, Struct Equ Model, 14, 535-569, (2007)
[23] Oberski, D., Beyond the number of classes: separating substantive from non-substantive dependence in latent class analysis, Adv Data Anal Classif, (2015)
[24] Rindskopf D (2002) The use of latent class analysis in medical diagnosis. Proceedings of the Annual Meeting of the American Statistical Association, American Statistical Association, Alexandria VA, pp 2912-2916
[25] Rubin, DB, The Bayesian bootstrap, Ann Stat, 9, 130-134, (1981)
[26] Schwarz, G., Estimating the dimension of a model, Ann Stat, 6, 461-464, (1978) · Zbl 0379.62005
[27] Self, SG; Mauritsen, RH; Ohara, J., Power calculations for likelihood ratio tests in generalized linear models, Biometrics, 48, 31-39, (1992)
[28] Shapiro, A., Asymptotic distribution of test statistics in the analysis of moment structures under inequality constraints, Biometrika, 72, 133-144, (1985) · Zbl 0596.62019
[29] Steiger, JH; Shapiro, A.; Browne, MW, On the multivariate asymptotic distribution of sequential Chi-square statistics, Psychometrika, 50, 253-263, (1985) · Zbl 0596.62020
[30] Takane Y, van der Heijden PGM, Browne MW (2003) On likelihood ratio tests for dimensionality selection. In: Higuchi T, Iba Y, Ishiguro M (eds) Proceedings of Science of Modeling: The 30th Anniversary Meeting of the Information Criterion (AIC). Report on Research and Education 17. The Institute of Statistical Mathematics, Tokyo, pp 348-349
[31] Tekle, FB; Tan, FEE; Berger, MPF, Maximin D-optimal designs for binary longitudinal responses, Comput Stat Data Anal, 52, 5253-5262, (2008) · Zbl 1452.62584
[32] Tollenaar, N.; Mooijaart, A., Type I errors and power of the parametric bootstrap goodness-of-fit test: full and limited information, Br J Math Stat Psychol, 56, 271-288, (2003)
[33] Heijden, PGM; HitHart, H.; Dessens, JAG; Rost, J. (ed.); Langeheine, R. (ed.), A parametric bootstrap procedure to perform statistical tests in latent class analysis, 190-202, (1997), New York
[34] Vermunt, JK, Latent class modeling with covariates: two improved three-step approaches, Political Anal, 18, 450-469, (2010)
[35] Vermunt JK, Magidson J (2008) Manual for latent GOLD 4.5 syntax module. Statistical Innovations Inc, Belmont, MA
[36] Vermunt JK, Magidson J (2013) Latent GOLD 5.0 Upgrade Manual. Statistical Innovations Inc, Belmont, MA
[37] Wolfe, JH, Pattern clustering by multivariate mixture analysis, Multivar Behav Res, 5, 329-350, (1970)
[38] Zenor, MJ; Srivastava, RK, Inferring market structure with aggregate data: a latent segment logit approach, J Mark Res, 25, 369-379, (1993)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.