Albert, James H.; Chib, Siddhartha Bayesian analysis of binary and polychotomous response data. (English) Zbl 0774.62031 J. Am. Stat. Assoc. 88, No. 422, 669-679 (1993). Summary: A vast literature in statistics, biometrics, and econometrics is concerned with the analysis of binary and polychotomous response data. The classical approach fits a categorical response regression model using maximum likelihood, and inferences about the model are based on the associated asymptotic theory. The accuracy of classical confidence statements is questionable for small sample sizes.In this article, exact Bayesian methods for modeling categorical response data are developed using the idea of data augmentation. The general approach can be summarized as follows. The probit regression model for binary outcomes is seen to have an underlying normal regression structure on latent continuous data. Values of the latent data can be simulated from suitable truncated normal distributions. If the latent data are known, then the posterior distribution of the parameters can be computed using standard results for normal linear models. Draws from this posterior are used to sample new latent data, and the process is iterated with Gibbs sampling.This data augmentation approach provides a general framework for analyzing binary regression models. It leads to the same simplification achieved earlier for censored regression models. Under the proposed framework, the class of probit regression models can be enlarged by using mixtures of normal distributions to model the latent data. In this normal mixture class, one can investigate the sensitivity of the parameter estimates to the choice of “link function”, which relates the linear regression estimate to the fitted probabilities. In addition, this approach allows one to easily fit Bayesian hierarchical models. One specific model considered here reflects the belief that the vector of regression coefficients lies on a smaller dimension linear subspace.The methods can also be generalized to multinomial response models with \(J>2\) categories. In the ordered multinomial model, the \(J\) categories are ordered and a model is written linking the cumulative response probabilities with the linear regression structure. In the unordered multinomial model, the latent variables have a multivariate normal distribution with unknown variance-covariance matrix. For both multinomial models, the data augmentation method combined with Gibbs sampling is outlined. This approach is especially attractive for the multivariate probit model, where calculating the likelihood can be difficult. Cited in 12 ReviewsCited in 447 Documents MSC: 62F15 Bayesian inference 62J12 Generalized linear models (logistic models) Keywords:binary probit; multinomial probit; residual analysis; Student-\(t\) link function; polychotomous response data; maximum likelihood; categorical response data; data augmentation; probit regression model; normal regression structure on latent continuous data; truncated normal distributions; posterior distribution; Gibbs sampling; binary regression models; censored regression models; mixtures of normal distributions; sensitivity; Bayesian hierarchical models; multinomial response models; ordered multinomial model; unordered multinomial model; multivariate normal; multivariate probit PDF BibTeX XML Cite \textit{J. H. Albert} and \textit{S. Chib}, J. Am. Stat. Assoc. 88, No. 422, 669--679 (1993; Zbl 0774.62031) Full Text: DOI OpenURL