×

zbMATH — the first resource for mathematics

Random partition models with regression on covariates. (English) Zbl 1191.62073
Summary: Many recent applications of nonparametric Bayesian inference use random partition models, i.e. probability models for clustering a set of experimental units. We review the popular basic constructions. We then focus on an interesting extension of such models. In many applications covariates are available that could be used to a priori inform the clustering. This leads to random clustering models indexed by covariates, i.e., regression models with the outcome being a partition of the experimental units. We discuss some alternative approaches that have been used in the recent literature to implement such models, with an emphasis on a recently proposed extension of product partition models. Several of the reviewed approaches were not originally intended as covariate-based random partition models, but can be used for such inference.

MSC:
62G08 Nonparametric regression and quantile regression
62F15 Bayesian inference
62H30 Classification and discrimination; cluster analysis (statistical aspects)
62C10 Bayesian problems; characterization of Bayes procedures
PDF BibTeX XML Cite
Full Text: DOI
References:
[1] Antoniak, C.E., Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems, The annals of statistics, 2, 1152-1174, (1974) · Zbl 0335.60034
[2] Barry, D.; Hartigan, J.A., A Bayesian analysis for change point problems, Journal of the American statistical association, 88, 309-319, (1993) · Zbl 0775.62065
[3] Bishop, C.M., Svensén, M., 2003. Bayesian hierarchical mixtures of experts. In: Kjaerulff, U., Meek, C. (Eds.), 2003 Proceedings of Nineteenth Conference on Uncertainty in Artificial Intelligence, pp. 57-64.
[4] Chung, Y., Dunson, D.B., 2008. Nonparametric Bayes conditional distribution modeling with variable selection, Technical Report, Department of Statistical Science, Duke University. · Zbl 1205.62039
[5] Dahl, D.B., 2008. Distance-based probability distribution for set partitions with applications to Bayesian nonparametrics. In: JSM Proceedings, Section on Bayesian Statistical Science, American Statistical Association, Alexandria, VA .
[6] Dasgupta, A.; Raftery, A.E., Detecting features in spatial point processes with clutter via model-based clustering, Journal of the American statistical association, 93, 294-302, (1998) · Zbl 0906.62105
[7] Dunson, D.B.; Park, J.-H., Kernel stick-breaking processes, Biometrika, 95, 307-323, (2008) · Zbl 1437.62448
[8] Dunson, D.B.; Pillai, N.; Park, J.-H., Bayesian density regression, Journal of the royal statistical society, series B: statistical methodology, 69, 163-183, (2007) · Zbl 1120.62025
[9] Ferguson, T.S., A Bayesian analysis of some nonparametric problems, The annals of statistics, 1, 209-230, (1973) · Zbl 0255.62037
[10] Griffin, J.E.; Steel, M.F.J., Order-based dependent Dirichlet processes, Journal of the American statistical association, 101, 179-194, (2006) · Zbl 1118.62360
[11] Hartigan, J.A., Partition models, Communications in statistics, part A—theory and methods, 19, 2745-2756, (1990)
[12] Jordan, M.; Jacobs, R., Hierarchical mixtures-of-experts and the EM algorithm, Neural computation, 6, 181-214, (1994)
[13] MacEachern, S.N., 1999. Dependent Nonparametric Processes. In: ASA Proceedings of the Section on Bayesian Statistical Science, American Statistical Association, Alexandria, VA.
[14] Monni, S., Tadesse, M., 2008. A stochastic partitioning method to associate high-dimensional responses and covariates, Technical Report, Department of Biostatistics and Epidemiology, University of Pennsylvania. · Zbl 1330.62035
[15] Müller, P.; Erkanli, A.; West, M., Bayesian curve Fitting using multivariate normal mixtures, Biometrika, 83, 67-79, (1996) · Zbl 0865.62029
[16] Müller, P., Quintana, F., Rosner, G., 2008. Bayesian clustering with regression, Technical Report, M.D. Anderson Cancer Center, Houston, TX.
[17] Park, J.H., Dunson, D.B., 2010. Bayesian generalized product partition model. Statistica Sinica, 20, to appear. · Zbl 05769963
[18] Pitman, J., 1996, Some Developments of the Blackwell-MacQueen Urn Scheme. In: Ferguson, T.S., Shapeley, L.S., MacQueen, J.B., Haywar (Eds.) Statistics, Probability and Game Theory. Papers in Honor of David Blackwell, IMS Lecture Notes—Monograph Series, California, pp. 245-268.
[19] Quintana, F.A., A predictive view of Bayesian clustering, Journal of statistical planning and inference, 136, 2407-2429, (2006) · Zbl 1090.62023
[20] Sethuraman, J., A constructive definition of Dirichlet priors, Statistica sinica, 4, 639-650, (1994) · Zbl 0823.62007
[21] Shahbaba, B., Neal, R.M., 2007. Nonlinear models using Dirichlet Process Mixtures, Technical Report 0707, Department of Statistics, University of Toronto. · Zbl 1235.62069
[22] Wang, E.; Ngalame, Y.; Panelli, M.C.; Nguyen-Jackson, H.; Deavers, M.; Müller, P.; Hu, W.; Savary, C.A.; Kobayashi, R.; Freedman, R.S.; Marincola, F.M., Peritoneal and subperitoneal stroma may facilitate regional spread of Ovarian cancer, Clinical cancer research, 11, 113-122, (2005)
[23] Wood, S.A.; Kohn, R.; Cottet, R.; Jiang, W.; Tanner, M., Locally adaptive nonparametric binary regression, Journal of computational and graphical statistics, 17, 352-372, (2008)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.