×

Penalized clustering of large-scale functional data with multiple covariates. (English) Zbl 1469.62288

Summary: We propose a penalized clustering method for large-scale data with multiple covariates through a functional data approach. In our proposed method, responses and covariates are linked together through nonparametric multivariate functions (fixed effects), which have great flexibility in modeling various function features, such as jump points, branching, and periodicity. Functional ANOVA is used to further decompose multivariate functions in a reproducing kernel Hilbert space and provide associated notions of main effect and interaction. Parsimonious random effects are used to capture various correlation structures. The mixed-effects models are nested under a general mixture model in which the heterogeneity of functional data is characterized. We propose a penalized Henderson’s likelihood approach for model fitting and design a rejection-controlled EM algorithm for the estimation. Our method selects smoothing parameters through generalized cross-validation. Furthermore, Bayesian confidence intervals are used to measure the clustering uncertainty. Simulation studies and real-data examples are presented to investigate the empirical performance of the proposed method. Open-source code is available in the R package MFDA.

MSC:

62H30 Classification and discrimination; cluster analysis (statistical aspects)
62R10 Functional data analysis
62J10 Analysis of variance and covariance (ANOVA)

Software:

R; MFDA; gss; fda (R)
PDFBibTeX XMLCite
Full Text: DOI arXiv