×

Mixtures of stochastic differential equations with random effects: application to data clustering. (English) Zbl 1335.62095

Summary: We consider \(N\) independent stochastic processes \((X_i(t), t \in [0, T_i])\), \(i = 1, \ldots, N\), defined by a stochastic differential equation with drift term depending on a random variable \(\phi_i\). The distribution of the random effect \(\phi_i\) is a Gaussian mixture distribution, depending on unknown parameters which are to be estimated from the continuous observation of the processes \(X_i\). The likelihood of the observation is explicit. When the number of components is known, we prove the consistency of the exact maximum likelihood estimators and use the EM algorithm to compute it. When the number of components is unknown, BIC (Bayesian Information Criterion) is applied to select it. To assign each individual to a class, we define a classification rule based on estimated posterior probabilities. A simulation study illustrates our estimation and classification method on various models. A real data analysis is performed on growth curves with convincing results.

MSC:

62H30 Classification and discrimination; cluster analysis (statistical aspects)
62F12 Asymptotic properties of parametric estimators
62P10 Applications of statistics to biology and medical sciences; meta analysis

Software:

MsdeParEst
PDFBibTeX XMLCite
Full Text: DOI HAL

References:

[1] Arribas-Gil, A.; De la Cruz, R.; Lebarbier, E.; Meza, C., Classification of longitudinal data through a semiparametric mixed-effects model based on lasso-type estimators, Biometrics, 71, 333-343 (2015) · Zbl 1390.62233
[2] Celeux, G.; Martin, O.; Lavergne, C., Mixture of linear mixed models—application to repeated data clustering, Stat. Model., 5, 243-267 (2005) · Zbl 1111.62103
[3] Comte, F.; Genon-Catalot, V.; Samson, A., Nonparametric estimation for stochastic differential equations with random effects, Stochastic Process. Appl., 123, 2522-2551 (2013) · Zbl 1284.62251
[4] Delattre, M.; Genon-Catalot, V.; Samson, A., Maximum likelihood estimation for stochastic differential equations with random effects, Scand. J. Statist., 40, 2, 322-343 (2013) · Zbl 1328.62148
[5] Dempster, A.; Laird, N.; Rubin, D., Maximum likelihood from incomplete data via the EM algorithm, J. R. Stat. Soc. Ser. B, 39, 1-38 (1977) · Zbl 0364.62022
[7] Dion, C.; Genon-Catalot, V., Bidimensional random effect estimation in mixed stochastic differential model, Stat. Inference Stoch. Process., 18 (2015) · Zbl 1342.62138
[8] Ditlevsen, S.; De Gaetano, A., Stochastic vs. deterministic uptake of dodecanedioic acid by isolated rat livers, Bull. Math. Biol., 67, 547-561 (2005) · Zbl 1334.92133
[9] Donnet, S.; Foulley, J.; Samson, A., Bayesian analysis of growth curves using mixed models defined by stochastic differential equations, Biometrics, 66, 733-741 (2010) · Zbl 1203.62187
[10] Donnet, S.; Samson, A., Parametric inference for mixed models defined by stochastic differential equations, ESAIM Probab. Stat., 12, 196-218 (2008) · Zbl 1182.62164
[11] Filipe, P.; Braumann, C.; Brites, N.; Roquete, C., Modelling animal growth in random environments: an application using nonparametric estimation, Biom. J., 52, 653-666 (2010) · Zbl 1203.62138
[12] Filipe, P.; Braumann, C.; Brites, N.; Roquete, C., Recent Development in Modeling and Applications in Statistics (2013), Springer
[14] Jacques, J.; Preda, C., Functional data clustering: a survey, Adv. Data Anal. Classif., 8, 231-255 (2014) · Zbl 1414.62018
[15] Jaffrézic, F.; Meza, C.; Lavielle, M.; Foulley, J., Genetic analysis of growth curves using the SAEM algorithm, Genet. Sel. Evol., 38, 583-600 (2006)
[16] Keribin, C., Consistent estimation of the order of mixture models, Sankhyā, 62, 49-66 (2000) · Zbl 1081.62516
[17] Leroux, B., Maximum penalized likelihood estimation for independent and Markov-dependent mixture models, Biometrics, 48, 545-558 (1992)
[18] Lipster, R.; Shiryaev, A., Statistics of Random Processes I: General Theory (2001), Springer · Zbl 1008.62072
[19] Maugis, C.; Michel, B., A non asymptotic penalized criterion for gaussian mixture model selection, ESAIM Probab. Stat., 15, 41-68 (2011) · Zbl 1395.62162
[21] Overbeck, L., Estimation for continuous branching processes, Scand. J. Statist., 25, 111-126 (1998) · Zbl 0905.62083
[22] Picchini, U.; De Gaetano, A.; Ditlevsen, S., Stochastic differential mixed-effects models, Scand. J. Stat., 37, 67-90 (2010) · Zbl 1224.62041
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.