zbMATH — the first resource for mathematics

Simulation-based regularized logistic regression. (English) Zbl 1330.62301
Summary: In this paper, we develop a simulation-based framework for regularized logistic regression, exploiting two novel results for scale mixtures of normals. By carefully choosing a hierarchical model for the likelihood by one type of mixture, and implementing regularization with another, we obtain new MCMC schemes with varying efficiency depending on the data type (binary v. binomial, say) and the desired estimator (maximum likelihood, maximum a posteriori, posterior mean). Advantages of our omnibus approach include flexibility, computational efficiency, applicability in \(p\gg n\) settings, uncertainty estimates, variable selection, and assessing the optimal degree of regularization. We compare our methodology to modern alternatives on both synthetic and real data. An R package called reglogit is available on CRAN.

62J12 Generalized linear models (logistic models)
62F15 Bayesian inference
62-04 Software, source code, etc. for problems pertaining to statistics
62J07 Ridge regression; shrinkage estimators (Lasso)
CRAN; R; reglogit
Full Text: DOI Euclid