×

Multi-sample comparison using spatial signs for infinite dimensional data. (English) Zbl 07603095

Summary: We consider an analysis of variance type problem, where the sample observations are random elements in an infinite dimensional space. This scenario covers the case, where the observations are random functions. For such a problem, we propose a test based on spatial signs. We develop an asymptotic implementation as well as a bootstrap implementation and a permutation implementation of this test and investigate their size and power properties. We compare the performance of our test with that of several mean based tests of analysis of variance for functional data studied in the literature. Interestingly, our test not only outperforms the mean based tests in several non-Gaussian models with heavy tails or skewed distributions, but in some Gaussian models also. Further, we also compare the performance of our test with the mean based tests in several models involving contaminated probability distributions. Finally, we demonstrate the performance of these tests in three real datasets: a Canadian weather dataset, a spectrometric dataset on chemical analysis of meat samples and a dataset on orthotic measurements on volunteers.

MSC:

62R10 Functional data analysis
62J10 Analysis of variance and covariance (ANOVA)

Software:

fdANOVA; MNM
PDFBibTeX XMLCite
Full Text: DOI arXiv Link

References:

[1] ARISTIZABAL, J.-P., GIRALDO, R. and MATEU, J. (2019). Analysis of variance for spatially correlated functional data: application to brain data. Spatial Statistics 32 100381.
[2] BHATIA, R. (2009). Notes on Functional Analysis 50. Hindustan Book Agency. · Zbl 1175.46001
[3] Billingsley, P. (2013). Convergence of Probability Measures. John Wiley & Sons.
[4] BOROVSKIKH, I. I. V. (1996). U-statistics in Banach Spaces. VSP. · Zbl 0865.60004
[5] BRUNNER, E., DETTE, H. and MUNK, A. (1997). Box-type approximations in nonparametric factorial designs. Journal of the American Statistical Association 92 1494-1502. · Zbl 0921.62096
[6] BRUNNER, E., KONIETSCHKE, F., PAULY, M. and PURI, M. L. (2017). Rank-based procedures in factorial designs: Hypotheses about non-parametric treatment effects. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 79 1463-1485. · Zbl 1458.62169
[7] Cardot, H., Cénac, P. and Zitt, P.-A. (2013). Efficient and fast estimation of the geometric median in Hilbert spaces with an averaged stochastic gradient algorithm. Bernoulli 19 18-43. · Zbl 1259.62068 · doi:10.3150/11-BEJ390
[8] CAUSEUR, D., SHEU, C.-F., PERTHAME, E. and RUFINI, F. (2019). A functional generalized F-test for signal detection with applications to event-related potentials significance analysis. Biometrics.
[9] CHAKRABORTY, A. and CHAUDHURI, P. (2014). The spatial distribution in infinite dimensional spaces and related quantiles and depths. The Annals of Statistics 42 1203-1231. · Zbl 1305.62141
[10] CHAKRABORTY, A. and CHAUDHURI, P. (2015). A Wilcoxon-Mann-Whitney-type test for infinite-dimensional data. Biometrika 102 239-246. · Zbl 1345.62083
[11] CHAUDHURI, P. (1996). On a geometric notion of quantiles for multivariate data. Journal of the American Statistical Association 91 862-872. · Zbl 0869.62040
[12] CHOI, K. and MARDEN, J. (1997). An approach to multivariate rank tests in multivariate analysis of variance. Journal of the American Statistical Association 92 1581-1590. · Zbl 0912.62065
[13] CHOI, K. and MARDEN, J. (2002). Multivariate analysis of variance using spatial ranks. Sociological Methods & Research 30 341-366.
[14] CUESTA-ALBERTOS, J. and FEBRERO-BANDE, M. (2010). A simple multiway ANOVA for functional data. Test 19 537-557. · Zbl 1203.62122
[15] CUEVAS, A., FEBRERO, M. and FRAIMAN, R. (2004). An anova test for functional data. Computational Statistics & Data Analysis 47 111-122. · Zbl 1429.62726
[16] DELAIGLE, A., HALL, P. and JIN, J. (2011). Robustness and accuracy of methods for high dimensional data analysis based on Student’s t-statistic. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 73 283-301. · Zbl 1411.62222
[17] Efron, B. and Tibshirani, R. J. (1994). An Introduction to the Bootstrap. CRC Press.
[18] GÓRECKI, T. and SMAGA, Ł. (2015). A comparison of tests for the one-way ANOVA problem for functional data. Computational Statistics 30 987-1010. · Zbl 1329.65028
[19] GÓRECKI, T. and SMAGA, Ł. (2019). fdANOVA: an R software package for analysis of variance for univariate and multivariate functional data. Computational Statistics 34 571-597. · Zbl 1417.62007
[20] GRETTON, A., BORGWARDT, K. M., RASCH, M. J., SCHÖLKOPF, B. and SMOLA, A. (2012). A Kernel Two-Sample Test. Journal of Machine Learning Research 13 723-773. · Zbl 1283.62095
[21] GUO, J. and CHEN, Y. (2019). An \[{L^2}\]-norm based ANOVA test for the equality of weakly dependent functional time series. Statistics and Its Interface 12 167-180. · Zbl 06980040
[22] HARRAR, S. W. and KONG, X. (2022). Recent developments in high-dimensional inference for multivariate data: Parametric, semiparametric and nonparametric approaches. Journal of Multivariate Analysis 188 104855. · Zbl 1493.62340
[23] HORVÁTH, L. and RICE, G. (2015). An introduction to functional data analysis and a principal component approach for testing the equality of mean curves. Revista Matemática Complutense 28 505-548. · Zbl 1347.60028
[24] JANSSEN, A. and PAULS, T. (2003). How do bootstrap and permutation tests work? The Annals of Statistics 31 768-806. · Zbl 1028.62027
[25] KOLTCHINSKII, V. I. (1997). M-estimation, convexity and quantiles. The Annals of Statistics 25 435-477. · Zbl 0878.62037
[26] KONG, X. and HARRAR, S. W. (2020). High-dimensional rank-based inference. Journal of Nonparametric Statistics 32 294-322. · Zbl 1442.62069
[27] KRUSKAL, W. H. and WALLIS, W. A. (1952). Use of ranks in one-criterion variance analysis. Journal of the American statistical Association 47 583-621. · Zbl 0048.11703
[28] KUNDU, S., MAJUMDAR, S. and MUKHERJEE, K. (2000). Central limit theorems revisited. Statistics & Probability Letters 47 265-275. · Zbl 0963.60016
[29] MÖTTÖNEN, J. and OJA, H. (1995). Multivariate spatial sign and rank methods. Journal of Nonparametric Statistics 5 201-213. · Zbl 0857.62056
[30] MÖTTÖNEN, J., OJA, H. and TIENARI, J. (1997). On the efficiency of multivariate spatial sign and rank tests. The Annals of Statistics 25 542-552. · Zbl 0873.62048
[31] OJA, H. (2010). Multivariate Nonparametric Methods with R: An Approach Based on Spatial Signs and Ranks. Springer Science & Business Media. · Zbl 1269.62036
[32] PAULY, M., BRUNNER, E. and KONIETSCHKE, F. (2015). Asymptotic permutation tests in general factorial designs. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 77 461-473. · Zbl 1414.62339
[33] SHEN, Q. and FARAWAY, J. (2004). An F test for linear models with functional responses. Statistica Sinica 14 1239-1257. · Zbl 1060.62075
[34] SHINOHARA, R. T., SHOU, H., CARONE, M., SCHULTZ, R., TUNC, B., PARKER, D., MARTIN, M. L. and VERMA, R. (2019). Distance-based analysis of variance for brain connectivity. Biometrics.
[35] ZHANG, J.-T. (2011). Statistical inferences for linear models with functional responses. Statistica Sinica 21 1431-1451. · Zbl 1236.62081
[36] ZHANG, J.-T. (2013). Analysis of Variance for Functional Data. Chapman & Hall/CRC Monographs on Statistics & Applied Probability. CRC Press.
[37] ZHANG, J.-T. and CHEN, J. (2007). Statistical inferences for functional data. The Annals of Statistics 35 1052-1079. · Zbl 1129.62029
[38] ZHANG, J.-T., CHENG, M.-Y., WU, H.-T. and ZHOU, B. (2019). A new test for functional one-way ANOVA with applications to ischemic heart screening. Computational Statistics & Data Analysis 132 3-17. · Zbl 1507.62204
[39] ZHANG, J.-T. and LIANG, X. (2014). One-way ANOVA for functional data via globalizing the pointwise F-test. Scandinavian Journal of Statistics 41 51-71. · Zbl 1349.62331
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.