Bayesian forecasting with a regime-switching zero-inflated multilevel Poisson regression model: an application to adolescent alcohol use with spatial covariates. (English) Zbl 1490.62409

Summary: In this paper, we present and evaluate a novel Bayesian regime-switching zero-inflated multilevel Poisson (RS-ZIMLP) regression model for forecasting alcohol use dynamics. The model partitions individuals’ data into two phases, known as regimes, with: (1) a zero-inflation regime that is used to accommodate high instances of zeros (non-drinking) and (2) a multilevel Poisson regression regime in which variations in individuals’ log-transformed average rates of alcohol use are captured by means of an autoregressive process with exogenous predictors and a person-specific intercept. The times at which individuals are in each regime are unknown, but may be estimated from the data. We assume that the regime indicator follows a first-order Markov process as related to exogenous predictors of interest. The forecast performance of the proposed model was evaluated using a Monte Carlo simulation study and further demonstrated using substance use and spatial covariate data from the Colorado Online Twin Study (CoTwins). Results showed that the proposed model yielded better forecast performance compared to a baseline model which predicted all cases as non-drinking and a reduced ZIMLP model without the RS structure, as indicated by higher AUC (the area under the receiver operating characteristic (ROC) curve) scores, and lower mean absolute errors (MAEs) and root-mean-square errors (RMSEs). The improvements in forecast performance were even more pronounced when we limited the comparisons to participants who showed at least one instance of transition to drinking.


62P15 Applications of statistics to psychology
62M10 Time series, auto-correlation, regression, etc. in statistics (GARCH)
62F15 Bayesian inference
62P10 Applications of statistics to biology and medical sciences; meta analysis
Full Text: DOI


[1] Arminger, G., Linear stochastic differential equation models for panel data with unobserved variables, Sociological Methodology, 16, 187-212 (1986)
[2] Berry, LR; West, M., Bayesian forecasting of many count-valued time series, Journal of Business and Economic Statistics, 38, 4, 872-887 (2020)
[3] Bradley, AP, The use of the area under the ROC curve in the evaluation of machine learning algorithms, Pattern Recognition, 30, 7, 1145-1159 (1997)
[4] Bronfenbrenner, U. (1992). Ecological systems theory. Jessica Kingsley Publishers.
[5] Byrnes, HF; Miller, BA; Morrison, CN; Wiebe, DJ; Remer, LG; Wiehe, SE, Brief report: Using global positioning system (GPS) enabled cell phones to examine adolescent travel patterns and time in proximity to alcohol outlets, Journal of Adolescence, 50, 65-68 (2016)
[6] Byrnes, H. F., Miller, B. A., Morrison, C. N., Wiebe, D. J., Woychik, M., & Wiehe, S. E. (2017). Association of environmental indicators with teen alcohol use and problem behavior: Teens’ observations vs. objectively-measured indicators. Health and Place,43, 151-157.
[7] Cao, H.; Li, X-L; Woon, DY-K; Ng, S-K, Integrated oversampling for imbalanced time series classification, IEEE Transactions on Knowledge and Data Engineering, 25, 12, 2809-2822 (2013)
[8] Chawla, NV; Bowyer, KW; Hall, LO; Kegelmeyer, WP, SMOTE: Synthetic minority over-sampling technique, Journal of Artificial Intelligence Research, 16, 321-357 (2002) · Zbl 0994.68128
[9] Chow, S-M, Practical tools and guidelines for exploring and fitting linear and nonlinear dynamical systems models, Multivariate Behavioral Research, 54, 5, 690-718 (2019)
[10] Chow, S-M; Witkiewitz, K.; Grasman, RPPP; Maisto, SA, The cusp catastrophe model as cross-sectional and longitudinal mixture structural equation models, Psychological Methods, 20, 142-164 (2015)
[11] Chow, S-M; Zhang, G., Nonlinear regime-switching state-space (RSSS) models, Psychometrika Application Reviews and Case Studies, 78, 4, 740-768 (2013) · Zbl 1288.62176
[12] Cudeck, R.; Browne, MW, Cross-validation of covariance structures, Multivariate Behavior Research, 18, 147-167 (1983)
[13] De Jong, P., A cross-validation filter for time series models, Biometrika, 75, 594-600 (1988) · Zbl 0651.62089
[14] Elkan, C. (2001). The foundations of cost-sensitive learning. In International joint conference on artificial intelligence (Vol .17, pp. 973-978).
[15] Ester, M., Kriegel, H.-P., Sander, J., & Xu, X. (1996). A density-based algorithm for discovering clusters in large spatial databases with noise. Kdd (Vol 96, pp. 226-231).
[16] Gelfand, A. E., Dey, D. K. & Chang, H. (1992). Model determination using predictive distributions with implementation via sampling-based methods. Bayesian Statistics 4 (p. 147-159). Oxford University Press.
[17] Gelman, A.; Carlin, JB; Stern, HS; Dunson, DB; Vehtari, A.; Rubin, DB, Bayesian data analysis (2013), New York: CRC Press, New York
[18] Geman, S.; Geman, D., Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images, IEEE Transactions on Pattern Analysis and Machine Intelligence, 6, 6, 721-741 (1984) · Zbl 0573.62030
[19] Geng, Y.; Luo, X., Cost-sensitive convolutional neural networks for imbalanced time series classification, Intelligent Data Analysis, 23, 2, 357-370 (2019)
[20] Hahsler, M.; Piekenbrock, M.; Doran, D., dbscan: Fast density-based clustering with R, Journal of Statistical Software, 25, 409-416 (2019)
[21] Hall, DB, Zero-inflated Poisson and binomial regression with random effects: A case study, Biometrics, 56, 4, 1030-1039 (2000) · Zbl 1060.62535
[22] Hamilton, J. D. (1994). Time series analysis (Vol. 2). Princeton New Jersey. · Zbl 0831.62061
[23] Hanley, JA; McNeil, BJ, The meaning and use of the area under a receiver operating characteristic (ROC) curve, Radiology, 143, 1, 29-36 (1982)
[24] Harvey, AC, Forecasting, structural time series models and the Kalman filter (2001), Cambridge: Cambridge University Press, Cambridge
[25] Helske, J., KFAS: Exponential family state space models in R, Journal of Statistical Software, 78, 10, 1-39 (2017)
[26] Howard, AL; Patrick, ME; Maggs, JL, College student affect and heavy drinking: Variable associations across days, semesters, and people, Psychology of Addictive Behaviors, 29, 2, 430 (2015)
[27] Jacobson, NC; Chow, S-M; Newman, MG, The differential time-varying effect model (DTVEM): Identifying optimal time lags in intensive longitudinal data, Behavioral Research Methods, 51, 1, 295-315 (2019)
[28] James, P.; Berrigan, D.; Hart, JE; Hipp, JA; Hoehner, CM; Kerr, J.; Laden, F., Effects of buffer size and shape on associations between the built environment and energy balance, Health and Place, 27, 162-170 (2014)
[29] Jane-Llopis, E.; Matytsina, I., Mental health and alcohol, drugs and tobacco: A review of the comorbidity between mental disorders and the use of alcohol, tobacco and illicit drugs, Drug and Alcohol Review, 25, 6, 515-536 (2006)
[30] Ji, L.; Chen, M.; Oravecz, Z.; Cummings, EM; Lu, Z-H; Chow, S-M, A Bayesian vector autoregressive model with nonignorable missingness in dependent variables and covariates: Development, evaluation, and application to family processes, Structural Equation Modeling: A Multidisciplinary Journal, 27, 3, 442-467 (2020)
[31] Kim, C.-J., & Nelson, C. R. (1999). State-space models with regime switching: classical and Gibbs-sampling approaches with applications. MIT Press Books.
[32] Kuiper, RM; Ryan, O., Drawing conclusions from cross-lagged relationships: Re-considering the role of the time-interval, Structural Equation Modeling: A Multidisciplinary Journal, 25, 5, 809-823 (2018)
[33] Kuppens, P.; Allen, NB; Sheeber, LB, Emotional inertia and psychological maladjustment, Psychological Science, 21, 7, 984-991 (2010)
[34] Lambert, D., Zero-inflated poisson regression, with an application to defects in manufacturing, Technometrics, 34, 1, 1-14 (1992) · Zbl 0850.62756
[35] Lee, AH; Wang, K.; Scott, JA; Yau, KK; McLachlan, GJ, Multi-level zero-inflated Poisson regression modelling of correlated count data with excess zeros, Statistical Methods in Medical Research, 15, 1, 47-61 (2006) · Zbl 1152.62083
[36] Li, Y., Ji, L., Oravecz, Z., Brick, T. R., Hunter, M. D., & Chow, S.-M. (2019). dynr.mi: An R program for multiple imputation in dynamic modeling. International Journal of Computer Electrical Automation Control and Information Engineering. 13(5), 302-311.
[37] Li, Y.; Wood, J.; Ji, L.; Chow, S-M; Oravecz, Z., Fitting multilevel vector autoregressive models in Stan, JAGS, and Mplus, Structural Equation Modeling A Multidisciplinary Journal, 5, 1-24 (2021)
[38] Litt, MD; Cooney, NL; Morse, P., Ecological momentary assessment (EMA) with treated alcoholics: Methodological problems and potential solutions, Health Psychology, 17, 1, 48 (1998)
[39] Little, RJA; Rubin, DB, Statistical analysis with missing data (1987), New York: Wiley, New York
[40] Lu, Z-H; Chow, S-M; Ram, N.; Cole, PM, Zero-inflated regime-switching stochastic differential equation models for highly unbalanced multivariate, multi-subject time-series data, Psychometrika, 84, 2, 611-645 (2019) · Zbl 1431.62549
[41] Lu, Z-H; Chow, S-M; Sherwood, A.; Zhu, H., Bayesian analysis of ambulatory cardiovascular dynamics with application to irregularly spaced sparse data, Annals of Applied Statistics, 9, 1601-1620 (2015) · Zbl 1454.62358
[42] Lütkepohl, H., Introduction to multiple time series analysis (2005), New York: Springer-Verlag, New York · Zbl 1072.62075
[43] MacCallum, RC; Roznowski, M.; Mar, CM; Reith, JV, Alternative strategies for cross-validation of covariance structure models, Multivariate Behavioral Research, 29, 1, 1-32 (1994)
[44] Maisto, SA; Xie, FC; Witkiewitz, K.; Ewart, CK; Connors, GJ; Zhu, H.; Chow, S-M, How chronic self-regulatory stress, poor anger regulation, and momentary affect undermine treatment for alcohol use disorder: Integrating social action theory and the dynamic model of relapse, Journal of Social and Clinical Psychology, 36, 238-263 (2017)
[45] Min, Y.; Agresti, A., Random effect models for repeated measures of zero-inflated count data, Statistical Modelling, 5, 1, 1-19 (2005) · Zbl 1070.62060
[46] Moniz, N.; Branco, P.; Torgo, L., Resampling strategies for imbalanced time series forecasting, International Journal of Data Science and Analytics, 3, 3, 161-181 (2017)
[47] Neal, RM, Slice sampling, Annals of Statistics, 31, 3, 705-741 (2003)
[48] Neelon, B. H., O’Malley, A. J., & Normand, S.-L.T. (2010). A Bayesian model for repeated measures zero-inflated count data with application to outpatient psychiatric service use. Statistical Modelling,10(4), 421-439. · Zbl 07256832
[49] Oravecz, Z.; Tuerlinckx, F.; Vandekerckhove, J., A hierarchical latent stochastic differential equation model for affective dynamics, Psychological Methods, 16, 4, 468 (2011)
[50] Orrù, G.; Monaro, M.; Conversano, C.; Gemignani, A.; Sartori, G., Machine learning in psychometrics and psychological research, Frontiers in Psychology, 10, 2970 (2020)
[51] Oud, JH; Jansen, RA, Continuous time state space modeling of panel data by means of SEM, Psychometrika, 65, 2, 199-215 (2000) · Zbl 1291.62240
[52] Pasch, KE; Hearst, MO; Nelson, MC; Forsyth, A.; Lytle, LA, Alcohol outlets and youth alcohol use: Exposure in suburban areas, Health and Place, 15, 2, 642-646 (2009)
[53] Perchoux, C.; Chaix, B.; Brondeel, R.; Kestens, Y., Residential buffer, perceived neighborhood, and individual activity space: New refinements in the definition of exposure areas-the RECORD Cohort Study, Health and Place, 40, 116-122 (2016)
[54] Piironen, J.; Vehtari, A., Comparison of Bayesian predictive methods for model selection, Statistics and Computing, 27, 3, 711-735 (2017) · Zbl 06737693
[55] Plummer, M., et al. (2003). JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling. In Proceedings of the 3rd international workshop on distributed statistical computing (Vol. 124, pp. 1-10).
[56] Reboussin, BA; Song, E-Y; Wolfson, M., The impact of alcohol outlet density on the geographic clustering of underage drinking behaviors within census tracts, Alcoholism Clinical and Experimental Research, 35, 8, 1541-1549 (2011)
[57] Roychoudhury, S., Ghalwash, M., & Obradovic, Z. (2017). Cost sensitive time-series classification. In Joint European conference on machine learning and knowledge discovery in databases (pp. 495-511).
[58] Russell, MA; Almeida, DM; Maggs, JL, Stressor-related drinking and future alcohol problems among university students, Psychology of Addictive Behaviors, 31, 6, 676 (2017)
[59] Russell, M. A., & Odgers, C. L. (2020). Adolescents’ subjective social status predicts day-to-day mental health and future substance use. Journal of Research on Adolescence,30, 532-544.
[60] Sánchez-Sánchez, P. A., García-González, J. R., & Coronell, L. H. P. (2019). Encountered problems of time series with neural networks: Models and architectures. IntechOpen: In Recent trends in artificial neural networks-from training to prediction.
[61] Shen, H., Exponentially weighted methods for forecasting intraday time series with multiple seasonal cycles: Comments, International Journal of Forecasting, 26, 653-654 (2010)
[62] Substance Abuse and Mental Health Services Administration, Office of Applied Studies. (2008). Results from the 2007 National Survey on Drug Use and Health: National Findings (DHHS Publication No. SMA 08-4343, NSDUH Series H-34). Rockville, MD: Substance Abuse and Mental Health Services Administration.
[63] Vehtari, A.; Gelman, A.; Gabry, J., Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC, Statistics and Computing, 27, 5, 1413-1432 (2017) · Zbl 06737719
[64] Voelkle, MC; Oud, JH; Davidov, E.; Schmidt, P., An SEM approach to continuous time modeling of panel data: Relating authoritarianism and anomia, Psychological Methods, 17, 2, 176 (2012)
[65] West, M.; Harrison, J., Bayesian forecasting and dynamic models (1997), New York: Springer-Verlag, New York · Zbl 0871.62026
[66] Wilhelm, F. H., Grossman, P., & Muller, M. I. (2012). Bridging the gap between the laboratory and the real world: Integrative ambulatory psychophysiology. In Handbook of research methods for studying daily life (pp. 210-234). Guilford: New York.
[67] Wray, TB; Merrill, JE; Monti, PM, Using ecological momentary assessment (EMA) to assess situation-level predictors of alcohol use and alcohol-related consequences, Alcohol Research: Current Reviews, 36, 1, 19 (2014)
[68] Yarkoni, T.; Westfall, J., Choosing prediction over explanation in psychology: Lessons from machine learning, Perspectives on Psychological Science, 12, 6, 1100-1122 (2017)
[69] Yau, KK; Lee, AH, Zero-inflated Poisson regression with random effects to evaluate an occupational injury prevention programme, Statistics in Medicine, 20, 19, 2907-2920 (2001)
[70] You, D.; Hunter, M.; Chen, M.; Chow, S-M, A diagnostic procedure for detecting outliers in linear state-space models, Multivariate Behavioral Research (2019)
[71] Zhou, S., Li, Y., Bodovski, Y., Chi, G., & Chow, S.-M. (2021a). GPS2space: An open-source Python library for spatial data building and spatial measure extraction. https://github.com/shuai-zhou/gps2space. doi:10.5281/zenodo.4672651.
[72] Zhou, S., Li, Y., Chi, G., Yin, J., Oravecz, Z., Bodovski, Y., ... & Chow, S. M. (2021b). GPS2space: an open-source Python library for spatial measure extraction from GPS data. Journal of Behavioral Data Science, 1(2), 127-155.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.