×

Boosting functional response models for location, scale and shape with an application to bacterial competition. (English) Zbl 07506786

Summary: We extend generalized additive models for location, scale and shape (GAMLSS) to regression with functional response. This allows us to simultaneously model point-wise mean curves, variances and other distributional parameters of the response in dependence of various scalar and functional covariate effects. In addition, the scope of distributions is extended beyond exponential families. The model is fitted via gradient boosting, which offers inherent model selection and is shown to be suitable for both complex model structures and highly auto-correlated response curves. This enables us to analyse bacterial growth in Escherichia coli in a complex interaction scenario, fruitfully extending usual growth models.

MSC:

62-XX Statistics
PDF BibTeX XML Cite
Full Text: DOI

References:

[1] Brockhaus, S, Fuest, A, Mayr, A, Greven, S (2018a) Signal regression models for loca- tion, scale and shape with an application to stock returns. Journal of the Royal Statistical Society: Series C, 67, 665-86.
[2] Brockhaus, S, Melcher, M, Leisch, F, Greven, S (2017) Boosting flexible functional regression models with a high number of functional historical effects. Statistics and Computing, 27, 913-26. · Zbl 1384.62131
[3] Brockhaus, S, Ruegamer, D (2018) FDboost: Boosting functional regression models. R package version 0.3-1.
[4] Brockhaus, S, Rügamer, D, Greven, S (2018b) Boosting functional regression models with FDboost. Journal of Statistical Software. URL https://arxiv.org/pdf/1705.10662.pdf (last accessed 6 April 2020).
[5] Brockhaus, S, Scheipl, F, Greven, S (2015) The functional linear array model. Statistical Modelling, 15, 279-300. · Zbl 07258990
[6] Bühlmann, P, Hothorn, T (2007) Boosting algorithms: Regularization, prediction and model fitting (with discussion). Statistical Science, 22, 477-505. · Zbl 1246.62163
[7] Eilers, PHC, Marx, BD (2010) Splines, knots, and penalties. Wiley Interdisciplinary Reviews: Computational Statistics, 2, 637-53.
[8] Gasser, T, Müller, H-G, Kohler, W, Molinari, L, Prader, A (1984) Nonparametric regression analysis of growth curves. The Annals of Statistics, 12, 210-29. · Zbl 0535.62088
[9] Gertheiss, J, Maier, V, Hessel, EF, Staicu, A-M (2015) Marginal functional regression models for analysing the feeding behaviour of pigs. Journal of Agricultural, Biological, and Environmental Statistics, 20, 353-70.
[10] Goldsmith, J, Scheipl, F, Huang, L, Wrobel, J, Gellar, J, Harezlak, J, McLean, MW, Swihart, B, Xiao, L, Crainiceanu, C, Reiss, PT (2018) refund: Regression with Functional Data. R package version 0.1-17.
[11] Goldsmith, J, Zipunnikov, V, Schrack, J (2015) Generalized multilevel function-on-scalar regression and principal component analysis. Biometrics, 71, 344-53. · Zbl 1390.62259
[12] Greven, S, Scheipl, F (2017) A general framework for functional regression modelling (with discussion). Statistical Modelling, 17, 1-35. · Zbl 07289474
[13] Hall, P, Mueller, H-G, Yao, F (2008) Modelling sparse generalized longitudinal observations with latent Gaussian processes. Journal of the Royal Statistical Society: Series B, 70, 703-23. · Zbl 05563365
[14] Hastie, T, Tibshirani, R (1990) Generalized Additive Models. London: Chapman & Hall. · Zbl 0747.62061
[15] Hofner, B, Hothorn, T, Kneib, T, Schmid, M (2011) A framework for unbiased model selection based on boosting. Journal of Computational and Graphical Statistics, 20, 956-71.
[16] Hofner, B, Mayr, A, Fenske, N, Schmid, M (2017) gamboostLSS: Boosting methods for GAMLSS models. R package version 2.0-0.
[17] Kneib, T, Hothorn, T, Tutz, G (2009) Variable selection and model choice in geoadditive regression models. Biometrics, 65, 626-34. · Zbl 1167.62096
[18] Li, H, Staudenmayer, J, Carroll, RJ (2014) Hierarchical functional data with mixed continuous and binary measurements. Biometrics, 70, 802-11. · Zbl 1393.62076
[19] López, S, Prieto, M, Dijkstra, J, Dhanoa, M, France, J (2004) Statistical evaluation of mathematical models for microbial growth. International Journal of Food Microbiology, 96, 289-300.
[20] Mayr, A, Fenske, N, Hofner, B, Kneib, T, Schmid, M (2012) Generalized additive models for location, scale and shape for high dimensional data: A flexible approach based on boosting. Journal of the Royal Statistical Society: Series C (Applied Statistics), 61, 403-27.
[21] Morris, JS (2015) Functional regression. Annual Review of Statistics and Its Application, 2, 321-59.
[22] Perni, S, Andrew, PW, Shama, G (2005) Estimating the maximum growth rate from microbial growth curves: Definition is everything. Food Microbiology, 22, 491-95.
[23] R Core Team (2018) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. R version 3.5.1.
[24] Ramsay, J, Silverman, BW (2005) Functional Data Analysis. New York, NY: Springer Science & Business Media. · Zbl 1079.62006
[25] Rigby, RA, Stasinopoulos, DM (2005) Generalized additive models for location, scale and shape. Journal of the Royal Statistical Society. Series C (Applied Statistics), 54, 507-54. · Zbl 05188697
[26] Scheipl, F, Gertheiss, J, Greven, S (2016) Generalized functional additive mixed models. Electronic Journal of Statistics, 10, 1455-92. · Zbl 1341.62242
[27] Scheipl, F, Staicu, A-M, Greven, S (2015) Functional additive mixed models. Journal of Computational and Graphical Statistics, 24, 477-501. · Zbl 1430.62082
[28] Staicu, A-M, Crainiceanu, CM, Reich, DS, Ruppert, D (2012) Modeling functional data with spatially heterogeneous shape characteristics. Biometrics, 68, 331-43. · Zbl 1251.62050
[29] Stasinopoulos, M, Rigby, R (2019) gamlss.dist: Distributions for generalized additive models for location scale and shape. R package version 5.1-4.
[30] Thomas, J, Mayr, A, Bischl, B, Schmid, M, Smith, A, Hofner, B (2018) Gradient boosting for distributional regression: Faster tuning and improved variable selection via noncyclical updates. Statistics and Computing, 28, 673-87. · Zbl 1384.62139
[31] van der Linde, A (2009) A Bayesian latent variable approach to functional principal components analysis with binary and count data. AStA Advances in Statistical Analysis, 93, 307-33. · Zbl 1331.62299
[32] von Bronk, B, Schaffer, SA, Götz, A, Opitz, M (2017) Effects of stochasticity and division of labor in toxin production on two-strain bacterial competition in Escherichia coli. PLoS Biology, 15, e2001457.
[33] Wang, B, Shi, JQ (2014) Generalized Gaussian process regression model for non-Gaussian functional data. Journal of the American Statistical Association, 109, 1123-33. · Zbl 1368.62101
[34] Weber, MF, Poxleitner, G, Hebisch, E, Frey, E, Opitz, M (2014) Chemical warfare and survival strategies in bacterial range expansions (online supplement). Journal of The Royal Society Interface, 11, 20140172.
[35] Wood, SN (2006) Low-rank scale-invariant tensor product smooths for generalized additive mixed models. Biometrics, 62, 1025-36. · Zbl 1116.62076
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.