zbMATH — the first resource for mathematics

Directional outlyingness for multivariate functional data. (English) Zbl 06970980
Summary: The direction of outlyingness is crucial to describing the centrality of multivariate functional data. Motivated by this idea, classical depth is generalized to directional outlyingness for functional data. Theoretical properties of functional directional outlyingness are investigated and the total outlyingness can be naturally decomposed into two parts: magnitude outlyingness and shape outlyingness which represent the centrality of a curve for magnitude and shape, respectively. This decomposition serves as a visualization tool for the centrality of curves. Furthermore, an outlier detection procedure is proposed based on functional directional outlyingness. This criterion applies to both univariate and multivariate curves and simulation studies show that it outperforms competing methods. Weather and electrocardiogram data demonstrate the practical application of our proposed framework.

62 Statistics
Full Text: DOI
[1] Apanasovich, T. V.; Genton, M. G.; Sun, Y., A valid Matérn class of cross-covariance functions for multivariate random fields with any number of components, J. Amer. Statist. Assoc., 107, 180-193, (2012) · Zbl 1261.62087
[2] Arribas-Gil, A.; Romo, J., Shape outlier detection and visualization for functional data: the outliergram, Biostatistics, 15, 603-619, (2014)
[3] Chakraborty, A.; Chaudhuri, P., On data depth in infinite dimensional spaces, Ann. Inst. Statist. Math., 66, 303-324, (2014) · Zbl 1336.62123
[4] Chakraborty, A.; Chaudhuri, P., The spatial distribution in infinite dimensional spaces and related quantiles and depths, Ann. Statist., 42, 1203-1231, (2014) · Zbl 1305.62141
[5] Claeskens, G.; Hubert, M.; Slaets, L.; Vakili, K., Multivariate functional halfspace depth, J. Amer. Statist. Assoc., 109, 411-423, (2014) · Zbl 1367.62162
[6] Cuevas, A.; Febrero, M.; Fraiman, R., On the use of the bootstrap for estimating functions with functional data, Comput. Statist. Data Anal., 51, 1063-1074, (2006) · Zbl 1157.62390
[7] Cuevas, A.; Fraiman, R., On depth measures and dual statistics. A methodology for dealing with general data, J. Multivariate Anal., 100, 753-766, (2009) · Zbl 1163.62039
[8] Donoho, D. L., Breakdown properties of multivariate location estimators, (1982), Harvard University, (Ph. D. Qualifying paper)
[9] Febrero, M.; Galeano, P.; González-Manteiga, W., Outlier detection in functional data by depth measures, with application to identify abnormal nox levels, Environmetrics, 19, 331-345, (2008)
[10] Ferraty, F.; Vieu, P., Nonparametric functional data analysis: theory and practice, (2006), Springer · Zbl 1119.62046
[11] Fraiman, R.; Muniz, G., Trimmed means for functional data, TEST, 10, 419-440, (2001) · Zbl 1016.62026
[12] Genton, M. G.; Johnson, C.; Potter, K.; Stenchikov, G.; Sun, Y., Surface boxplots, Stat, 3, 1-11, (2014)
[13] Gervini, D., Detecting and handling outlying trajectories in irregularly sampled functional datasets, Ann. Appl. Stat., 3, 1758-1775, (2009) · Zbl 1184.62101
[14] Gervini, D., Outlier detection and trimmed estimation for general functional data, Statist. Sinica, 22, 1639-1660, (2012) · Zbl 1253.62019
[15] Gijbels, I.; Nagy, S., Consistency of non-integrated depths for functional data, J. Multivariate Anal., 140, 259-282, (2015) · Zbl 1327.62305
[16] Gneiting, T.; Kleiber, W.; Schlather, M., Matérn cross-covariance functions for multivariate random fields, J. Amer. Statist. Assoc., 105, 1167-1177, (2010) · Zbl 1390.62194
[17] Goldberger, A. L.; Amaral, L. A.; Glass, L.; Hausdorff, J. M.; Ivanov, P. C.; Mark, R. G.; Mietus, J. E.; Moody, G. B.; Peng, C.-K.; Stanley, H. E., Physiobank, physiotoolkit, and physionet components of a new research resource for complex physiologic signals, Circulation, 101, e215-e220, (2000)
[18] Hardin, J.; Rocke, D. M., The distribution of robust distances, J. Comput. Graph. Statist., 14, 928-946, (2005)
[19] Hong, Y.; Davis, B.; Marron, J.; Kwitt, R.; Singh, N.; Kimbell, J. S.; Pitkin, E.; Superfine, R.; Davis, S. D.; Zdanski, C. J., Statistical atlas construction via weighted functional boxplots, Med. Image Anal., 18, 684-698, (2014)
[20] Horváth, L.; Kokoszka, P., Inference for functional data with applications, (2012), Springer · Zbl 1279.62017
[21] Huang, H., Sun, Y., 2016. Total variation depth for functional data. arXiv preprint arXiv:1611.04913.
[22] Hubert, M., Raymaekers, J., Rousseeuw, P.J., Segaert, P., 2016. Finding outliers in surface data and video. arXiv preprint arXiv:1601.08133.
[23] Hubert, M.; Rousseeuw, P. J.; Segaert, P., Multivariate functional outlier detection, Stat. Methods Appl., 24, 177-202, (2015) · Zbl 1441.62124
[24] Hyndman, R. J.; Shang, H. L., Rainbow plots, bagplots, and boxplots for functional data, J. Comput. Graph. Statist., 19, 29-45, (2010)
[25] Hyndman, R. J.; Ullah, M. S., Robust forecasting of mortality and fertility rates: a functional data approach, Comput. Statist. Data Anal., 51, 4942-4956, (2007) · Zbl 1162.62434
[26] Ieva, F.; Paganoni, A. M., Depth measures for multivariate functional data, Comm. Statist. Theory Methods, 42, 1265-1276, (2013) · Zbl 1347.62093
[27] Liu, R. Y., On a notion of data depth based on random simplices, Ann. Statist., 18, 405-414, (1990) · Zbl 0701.62063
[28] Long, J.P., Huang, J.Z., 2015. A study of functional depths. arXiv preprint arXiv:1506.01332v3.
[29] López-Pintado, S.; Romo, J., Depth-based classification for functional data, (DIMACS Series in Discrete Mathematics and Theoretical Computer Science. Data Depth: Robust Multivariate Analysis, Computational Geometry and Applications, Vol. 72, (2006)), 103-120
[30] López-Pintado, S.; Romo, J., On the concept of depth for functional data, J. Amer. Statist. Assoc., 104, 718-734, (2009) · Zbl 1388.62139
[31] López-Pintado, S.; Romo, J., A half-region depth for functional data, Comput. Statist. Data Anal., 55, 1679-1695, (2011) · Zbl 1328.62029
[32] López-Pintado, S.; Sun, Y.; Lin, J. K.; Genton, M. G., Simplicial band depth for multivariate functional data, Adv. Data Anal. Classif., 8, 321-338, (2014)
[33] Mahalanobis, P. C., On the generalized distance in statistics, Proc. Natl. Inst. Sci. India, 2, 49-55, (1936) · Zbl 0015.03302
[34] Matérn, B., Spatial variation, (1960), Springer
[35] McKeague, I. W.; López-Pintado, S.; Hallin, M.; Šiman, M., Analyzing growth trajectories, J. Dev. Orig. Health Dis., 2, 322-329, (2011)
[36] Möttönen, J.; Oja, H., Multivariate spatial sign and rank methods, J. Nonparametr. Stat., 5, 201-213, (1995) · Zbl 0857.62056
[37] Myllymäki, M.; Mrkvička, T.; Grabarnik, P.; Seijo, H.; Hahn, U., Global envelope tests for spatial processes, J. R. Stat. Soc. Ser. B Stat. Methodol., 79, 381-404, (2017)
[38] Nagy, S.; Gijbels, I.; Hlubinka, D., Weak convergence of discretely observed functional data with applications, J. Multivariate Anal., 146, 46-62, (2016) · Zbl 1334.62090
[39] Nagy, S.; Gijbels, I.; Omelka, M.; Hlubinka, D., Integrated depth for functional data: statistical properties and consistency, ESAIM Probab. Stat., 20, 95-130, (2016) · Zbl 1357.62201
[40] Narisetty, N. N.; He, X., Discussion of “multivariate functional outlier detection”, Stat. Methods Appl., 24, 209-215, (2015) · Zbl 1441.62129
[41] Narisetty, N. N.; Nair, V. N., Extremal depth for functional data and applications, J. Amer. Statist. Assoc., 111, 1705-1714, (2016)
[42] Ngo, D.; Sun, Y.; Genton, M. G.; Wu, J.; Srinivasan, R.; Cramer, S. C.; Ombao, H., An exploratory data analysis of electroencephalograms using the functional boxplots approach, Front. Neuroinf., 9, 282, (2015)
[43] Nieto-Reyes, A.; Battey, H., A topologically valid definition of depth for functional data, Stat. Sci., 31, 61-79, (2016) · Zbl 1436.62720
[44] Ramsay, J. O.; Silverman, B. W., Functional data analysis, (2005), Springer · Zbl 1079.62006
[45] Rousseeuw, P. J., Multivariate estimation with high breakdown point, (Grossmann, W.; Pflug, G.; Vincze, I.; Wert, W., In Mathematical Statistics and Applications, Volume B, (1985), Reidel, Dordrecht), 283-297
[46] Serfling, R., Depth functions in nonparametric multivariate inference, (Liu, R. Y.; Serfling, R.; Souvaine, D. L., In Data Depth: Robust Multivariate Analysis, Computational Geometry and Applications, Vol. 72, (2006), American Mathematical Society), 1-16
[47] Serfling, R., Equivariance and invariance properties of multivariate quantile and related functions, and the role of standardisation, J. Nonparametr. Stat., 22, 915-936, (2010) · Zbl 1203.62103
[48] Serfling, R.; Wijesuriya, U., Depth-based nonparametric description of functional data, with emphasis on use of spatial depth, Comput. Statist. Data Anal., 105, 24-45, (2017) · Zbl 06917869
[49] Stahel, W.A., 1981. Breakdown of covariance estimators. Research Report 31, Fachgruppe für Statistik, ETH, Zürich.
[50] Sun, Y.; Genton, M. G., Functional boxplots, J. Comput. Graph. Statist., 20, 316-334, (2011)
[51] Sun, Y.; Genton, M. G., Adjusted functional boxplots for spatio-temporal data visualization and outlier detection, Environmetrics, 23, 54-64, (2012)
[52] Tukey, J.W., 1975. Mathematics and the picturing of data. In: Proceedings of the International Congress of Mathematicians, Vol. 2, pp. 523-531. · Zbl 0347.62002
[53] Vardi, Y.; Zhang, C.-H., The multivariate \(L_1\)-Median and associated data depth, Proc. Natl. Acad. Sci., 97, 1423-1426, (2000) · Zbl 1054.62067
[54] Wei, L.; Keogh, E., Semi-supervised time series classification, (Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (2006), ACM), 748-753
[55] Yu, G.; Zou, C.; Wang, Z., Outlier detection in functional observations with applications to profile monitoring, Technometrics, 54, 308-318, (2012)
[56] Zuo, Y., Projection-based depth functions and associated medians, Ann. Statist., 31, 1460-1490, (2003) · Zbl 1046.62056
[57] Zuo, Y.; Serfling, R., General notions of statistical depth function, Ann. Statist., 28, 461-482, (2000) · Zbl 1106.62334
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.