Dealing with label switching in mixture models. (English) Zbl 0957.62020

Summary. In a Bayesian analysis of finite mixture models, parameter estimation and clustering are sometimes less straightforward than might be expected. In particular, the common practice of estimating parameters by their posterior mean, and summarizing joint posterior distributions by marginal distributions, often leads to nonsensical answers. This is due to the so-called ‘label switching’ problem, which is caused by symmetry in the likelihood of the model parameters.
A frequent response to this problem is to remove the symmetry by using artificial identifiability constraints. We demonstrate that this fails in general to solve the problem, and we describe an alternative class of approaches, relabelling algorithms, which arise from attempting to minimize the posterior expected loss under a class of loss functions. We describe in detail one particularly simple and general relabelling algorithm and illustrate its success in dealing with the label switching problem on two examples.


62F15 Bayesian inference
62H30 Classification and discrimination; cluster analysis (statistical aspects)
62C10 Bayesian problems; characterization of Bayes procedures
Full Text: DOI