×

A beta-binomial mixed-effects model approach for analysing longitudinal discrete and bounded outcomes. (English) Zbl 1429.62576

Summary: Patient-reported outcomes (PROs) are currently being increasingly used as primary outcome measures in observational and experimental studies since they inform clinicians and researchers about the health-status of patients and generate data to facilitate improved care. PROs usually appear as discrete and bounded with \(U\), \(J\), or inverse \(J\) shapes, and hence, exponential family members offer inadequate distributional fits. The beta-binomial distribution has been proposed in the literature to fit PROs. However, the fact that the beta-binomial distribution does not belong to the exponential family limits its applicability in the regression model context, and classical estimation approaches are not straightforward. Moreover, PROs are usually measured in a longitudinal framework in which individuals are followed up for a certain period. Hence, each individual obtains several scores of the PRO over time, which leads to the repeated measures and defines the correlation structure in the data. In this work, we have developed and proposed an estimation procedure for the analysis of correlated discrete and bounded outcomes, particularly PROs, by a beta-binomial mixed-effects model. Additionally, we have implemented the methodology in the PROreg package in \(R\). Because there are similar approaches in the literature to address the same issue, this work also incorporates a comparison study between our proposal and alternative methodologies commonly implemented in \(R\) and shows the superior performance of our estimation procedure. This paper was motivated by the analysis of the health-status of patients with chronic obstructive pulmonary disease, where the main objective is the assessment of risk factors that may affect the evolution of the disease. The application of the proposed approach in the study leads to clinically relevant results.

MSC:

62P10 Applications of statistics to biology and medical sciences; meta analysis

Software:

PROreg; HRQoL; R; pllf

References:

[1] Agresti, A. (2007). An introduction to categorical data analysis (2nd ed.). Hoboken, New Jersey: John Wiley & Sons, Inc. · Zbl 1266.62008
[2] Arostegui, I., Núñez‐Antón, V., & Quintana, J. M. (2007). Analysis of the short form‐36 SF‐36: The beta‐binomial distribution approach. Statistics in Medicine, 26, 1318-1342.
[3] Arostegui, I., Núñez‐Antón, V., & Quintana, J. M. (2012). Statistical approaches to analyse patient‐reported outcomes as response variables: An application to health‐related quality of life. Statistical Methods in Medical Research, 21, 189-214.
[4] Arostegui, I., Núñez‐Antón, V., & Quintana, J. M. (2013). On the recoding of continuous and bounded indexes to a binomial form: An application to quality‐of‐life scores. Journal of Applied Statistics, 40, 563-582. · Zbl 1514.62414
[5] Berger, J. O. (1985). Statistical decision theory and bayesian analysis. New York: Springer. · Zbl 0572.62008
[6] Bernardo, J., & Smith, A. (1994). Bayesian theory (Chapter 5, pp. 240-376). Wiley Series in Probability and Statistics. John Wiley & Sons, Inc. · Zbl 0796.62002
[7] Breslow, N. E., & Clayton, D. G. (1993). Approximate inference in generalized linear mixed models. Journal of the American Statistical Association, 88, 9-25. · Zbl 0775.62195
[8] Buist, A. S., Vollmer, W. M., & McBurnie, M. A. (2008). Worldwide burden of COPD in high‐and low‐income countries. Part I. The burden of obstructive lung disease (BOLD) initiative. International Journal of Tuberculosis and Lung Disease, 12, 703-708.
[9] Deshpande, P. R., Rajan, S., Sudeepthi, B. L., & Nazir, C. P. A. (2011). Patient‐reported outcomes: A new era in clinical research. Perspectives in Clinical Research, 2, 137-144.
[10] Dickey, J. (1982). Encyclopedia of statistical sciences, (Vol. 1, pp. 135-145.). Hoboken, New Jersey: John Wiley & Sons, Inc. · Zbl 0552.62001
[11] Esteban, C., Arostegui, I., Aburto, M., Moraza, J., Quintana, J. M., García‐Loizaga, A., … Capelastegui, A. (2016). Chronic obstructive pulmonary disease subtypes. Transitions over time. PLoS One, 11(9), e0161710.
[12] Garratt, A. M., Schmidt, L., Mackintosh, A., & Fitzpatrick, R. (2002). Quality of life measurement: Bibliographic study of patient assessed health outcome measures. British Medical Journal, 324, 1417-1421.
[13] George, K., Iddi, S., & Molenberghs, G. (2016). The combined model: A tool for simulating correlated counts with overdispersion. Communication in Statistics‐ Simulation and Computation, 45, 2491-2510. · Zbl 1347.62081
[14] Gibbons, R. D., Hedeker, D., & DuToit, S. (2010). Analysing cognitive test data: Distributions and non‐parametric random effects. Annual Review of Clinical Psychology, 6, 79-107.
[15] Johnson, N., Kemp, A., & Kotz, S. (2005). Univariate discrete distributions(3rd ed.). Wiley Series in Probability and Statistics. Wiley. · Zbl 1092.62010
[16] Jones, P. W., Quirk, F. H., & Baveystock, C. M. (1991). The St George’s respiratory questionnaire. Respiratory Medicine, 85, 25-31.
[17] Jørgensen, B. (1984). The delta algorithm and GLIM. International Statistical Review, 52, 283-300. · Zbl 0569.62063
[18] Lee, Y., & Nelder, J. A. (1996). Hierarchical generalized linear models. Journal of the Royal Statistical Society. Series B, 58, 619-678. · Zbl 0880.62076
[19] Lee, Y., & Nelder, J. A. (2001). Hierarchical generalized linear models: A synthesis of generalized linear models, random‐effects and structured dispersions. Biometrika, 88, 987-1006. · Zbl 0995.62066
[20] Lee, Y., Nelder, J. A., & Pawitan, Y. (2006). Generalized linear models with random effects: Unified analysis via H‐likelihood. London: Chapman & Hall/CRC. · Zbl 1110.62092
[21] McCullagh, P., & Nelder, J. A. (1989). Generalized linear models (2nd ed.). Chapman & Hall. · Zbl 0744.62098
[22] McCulloch, C. E., & Searle, S. R. (2001). Generalized, linear and mixed models. Hoboken, NJ, USA: John Wiley & Sons, Inc. · Zbl 0964.62061
[23] Molenberghs, G., Verbeke, G., Demétrio, C., & Vieira, A. (2010). A family of generalized linear models for repeated measures with normal and conjugate random effects. Statistical Science, 25(3), 325-347. · Zbl 1329.62342
[24] Molenberghs, G., Verbeke, G., Iddi, S., & Demétrio, C. (2012). A combined beta and normal random‐effects model for repeated, overdispersed binary and binomial data. Journal of Multivariate Analysis, 111, 94-109. · Zbl 1294.62040
[25] Murray, C. J., & López, A. D. (1997). Alternative projections of mortality and disability by cause 1990-2020: Global burden of disease study. Lancet, 349, 1498-504.
[26] Najera‐Zuloaga, J., Lee, D.‐J., & Arostegui, I. (2017). Comparison of beta‐binomial regression model approaches to analyze health‐related quality of life data. Statistical Methods in Medical Research, 27(10), 2989-3009.
[27] Patterson, H. D., & Thompson, R. (1971). Recovery of interblock information when block sizes are unequal. Biometrika, 58, 545-554. · Zbl 0228.62046
[28] Pinheiro, J., Bates, D. S. D., Sarkar, D., & R Core Team. (2016). nlme: Linear and nonlinear mixed effects models. R package.
[29] R Core Team. (2017). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing.
[30] Rigby, R. A., & Stasinopoulos, D. M. (2005). Generalized additive models for location, scale and shape. Applied Statistics, 54, 507-554. · Zbl 1490.62201
[31] Ronnegard, L., Shen, X., & Alam, M. (2010). hglm: A package for fitting hierarchical generalized linear models. The R Journal, 2(2), 20-28.
[32] Royston, P. (2007). Profile likelihood for estimation and confidence intervals. The Stata Journal, 7(3), 376-387.
[33] Ruppert, D., Wand, M. P., & Carroll, R. J. (2003). Semiparametric regression. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press. · Zbl 1038.62042
[34] Stansfeld, S., Roberts, R., & Foot, S. (1997). Assessing the validity of the SF‐36 general health survey. Quality of Life Research, 6, 217-224.
[35] Stasinopoulos, D. M., & Rigby, R. A. (2007). Generalized additive models for location scale and shape (GAMLSS) in R. Journal of Statistical Software, 23, 1-56.
[36] Ware, J. E., Snow, K. K., Kosinski, M. A., & Gandek, B. (1993). SF36 Health Survey, Manual and Interpretation Guides. Boston, Massachusetts: The Health Institute, New England Medical Center.
[37] Wiklund, I. (2004). Assessment of patient‐reported outcomes in clinical trials: The example of health‐related quality of life. Fundamental and Clinical Pharmacology, 18, 351-63.
[38] Wu, L. (2009). Mixed effects models for complex data. Monographs on Statistics & Applied Probability. Boca Raton, FL: CRC Press.
[39] Wu, H., Zhang, Y., & Long, J. D. (2017). Longitudinal beta‐binomial modeling using GEE for overdispersed binomial data. Statistics in Medicine, 36, 1029-1040.
[40] Zeger, S. L., & Liang, K.‐Y. (1986). Longitudinal data analysis for discrete and continuous outcomes. Biometrics, 42, 121-130.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.