Bayesian variable selection for latent class models. (English) Zbl 1226.62022

Summary: We develop a latent class model with class probabilities that depend on subject-specific covariates. One of our major goals is to identify important predictors of latent classes. We consider methodology that allows estimation of latent classes while allowing for variable selection uncertainty. We propose a Bayesian variable selection approach and implement a stochastic search Gibbs sampler for posterior computation to obtain model-averaged estimates of quantities of interest such as marginal inclusion probabilities of predictors. Our methods are illustrated through simulation studies and application to data on weight gain during pregnancy, where it is of interest to identify important predictors of latent weight gain classes.


62F15 Bayesian inference
62P10 Applications of statistics to biology and medical sciences; meta analysis
65C60 Computational problems in statistics (MSC2010)
Full Text: DOI Link


[1] Albert, Bayesian analysis of binary and polychotomous response data, Journal of the American Statistical Association 88 pp 669– (1993) · Zbl 0774.62031
[2] Celeux, Deviance information criteria for missing data models, Bayesian Analysis 1 pp 651– (2006) · Zbl 1331.62329
[3] Clyde, Flexible empirical Bayes estimation for wavelets, Journal of the Royal Statistical Society, Series B 62 pp 681– (2000) · Zbl 0957.62006
[4] Deierlein, Dietary energy density but not glycemic load is associated with gestational weight gain, American Journal of Clinical Nutrition 88 pp 693– (2008)
[5] Devroye, Non-uniform Random Variate Generation (1986)
[6] Dunson, Bayesian inference on changes in response densities over predictor clusters, Journal of the American Statistical Association 103 pp 1508– (2008) · Zbl 1286.62092
[7] Elliott, Using a Bayesian latent growth curve model to identify trajectories of positive affect and negative events following myocardial infarction, Biostatistics 6 pp 119– (2005) · Zbl 1069.62095
[8] Escobar, Bayesian density estimation and inference using mixtures, Journal of the American Statistical Association 90 pp 577– (1995) · Zbl 0826.62021
[9] Frühwirth-Schnatter, Statistical Modelling and Regression Structures-Festschrift in Honour of Ludwig Fahrmeir pp 111– (2010)
[10] Geisser, A predictive approach to model selection (Corr: V75 p765), Journal of the American Statistical Association 74 pp 153– (1979) · Zbl 0401.62036
[11] Gelfand, Bayesian model choice: Asymptotics and exact calculations, Journal of the Royal Statistical Society, Series B 56 pp 501– (1994) · Zbl 0800.62170
[12] Gelman, Simulating normalizing constants: From importance sampling to bridge sampling to path sampling, Statistical Science 13 pp 163– (1998) · Zbl 0966.65004
[13] George, Variable selection via Gibbs sampling, Journal of the American Statistical Association 88 pp 881– (1993)
[14] Green, Reversible jump Markov chain Monte Carlo computation and Bayesian model determination, Biometrika 82 pp 711– (1995) · Zbl 0861.62023
[15] Gustafson, Bayesian multinomial regression with class-specific predictor selection, The Annals of Applied Statistics 2 pp 1478– (2008) · Zbl 1156.62018
[16] Holmes, Bayesian auxiliary variable models for binary and multinomial regression, Bayesian Analysis 1 pp 145– (2006) · Zbl 1331.62142
[17] Institute of Medicine, Nutrition During Pregnancy. Part I, Weight Gain (1990)
[18] Institute of Medicine, Weight Gain During Pregnancy: Reexamining the Guidelines (2009)
[19] Jasra, Markov chain Monte Carlo methods and the label switching problem in Bayesian mixture modeling, Statistical Science 20 pp 50– (2005) · Zbl 1100.62032
[20] Jeffreys, Theory of Probability (1961)
[21] Kinney, Fixed and random effects selection in linear and logistic models, Biometrics 63 pp 690– (2007) · Zbl 1147.62022
[22] O’Brien, Bayesian multivariate logistic regression, Biometrics 60 pp 739– (2004) · Zbl 1274.62375
[23] Oh, Bayesian inference and model selection in latent class logit models with parameter constraints: An application to market segmentation, Journal of Applied Statistics 30 pp 191– (2003) · Zbl 1121.62455
[24] Park, The Bayesian lasso, Journal of the American Statistical Association 103 pp 681– (2008) · Zbl 1330.62292
[25] Richardson, On Bayesian analysis of mixtures with an unknown number of components, Journal of the Royal Statistical Society, Series B 59 pp 731– (1997) · Zbl 0891.62020
[26] Savitz, Study design and determinants of participation in an epidemiologic study of preterm delivery, Paediatric and Perinatal Epidemiology 13 pp 114– (1999)
[27] Scaccia, Bayesian growth curves using normal mixtures with nonparametric weights, Journal of Computational and Graphical Statistics 12 pp 308– (2003)
[28] Schwarz, Estimating the dimension of a model, Annals of Statistics 6 pp 461– (1978) · Zbl 0379.62005
[29] Scott, Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem, Annals of Statistics 38 pp 2857– (2010) · Zbl 1200.62020
[30] Spiegelhalter, Bayesian measures of model complexity and fit, Journal of the Royal Statistical Society, Series B 64 pp 583– (2002) · Zbl 1067.62010
[31] Tibshirani, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society, Series B 58 pp 267– (1996) · Zbl 0850.62538
[32] Viswanathan, Outcomes of maternal weight gain, Evidence Report/Technology Assessment 168 pp 1– (2008)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.