Dunson, David B.; Herring, Amy H.; Engel, Stephanie M. Bayesian selection and clustering of polymorphisms in functionally related genes. (English) Zbl 1469.62367 J. Am. Stat. Assoc. 103, No. 482, 534-546 (2008). Summary: In epidemiologic studies, there is often interest in assessing the relationship between polymorphisms in functionally related genes and a health outcome. For each candidate gene, single nucleotide polymorphism (SNP) data are collected at a number of locations, resulting in a large number of possible genotypes. Because instabilities can result in analyses that include all the SNPs, dimensionality is typically reduced by conducting single SNP analyses or attempting to identify haplotypes. This article proposes an alternative Bayesian approach for reducing dimensionality. A multilevel Dirichlet process prior is used for the distribution of the SNP-specific regression coefficients within genes, incorporating a variable selection-type mixture structure to allow SNPs with no effect. This structure allows simultaneous selection of important SNPs and soft clustering of SNPs having similar impact on the health outcome. The methods are illustrated using data from a study of pro- and anti-inflammatory cytokine polymorphisms and spontaneous preterm birth. Cited in 20 Documents MSC: 62P10 Applications of statistics to biology and medical sciences; meta analysis 92D30 Epidemiology Keywords:Bayesian; clustering; Dirichlet process; genetic association; hierarchical regression; multiple testing; nonparametric Bayes; single nucleotide polymorphisms; sparse regression PDFBibTeX XMLCite \textit{D. B. Dunson} et al., J. Am. Stat. Assoc. 103, No. 482, 534--546 (2008; Zbl 1469.62367) Full Text: DOI