×

Quadratic distances on probabilities: A unified foundation. (English) Zbl 1133.62001

Summary: This work builds a unified framework for the study of quadratic form distance measures as they are used in assessing goodness of fit of models. Many important procedures have this structure, but the theory for these methods is dispersed and incomplete. Central to the statistical analysis of these distances is the spectral decomposition of the kernel that generates the distance. We show how this determines the limiting distribution of natural goodness-of-fit tests. Additionally, we develop a new notion, the spectral degrees of freedom of the test, based on this decomposition. The degrees of freedom are easy to compute and estimate, and can be used as a guide in the construction of useful procedures in this class.

MSC:

62A01 Foundations and philosophical topics in statistics
62E20 Asymptotic distribution theory in statistics
62H05 Characterization and structure theory for multivariate probability distributions; copulas
62G10 Nonparametric hypothesis testing

Software:

ElemStatLearn

References:

[1] Axler, S., Bourdon, P. and Ramey, W. (2001). Harmonic Function Theory , 2nd ed. Springer, New York. · Zbl 0959.31001 · doi:10.1007/b97238
[2] Bhatia, R. (2003). Fourier Series . Reprint of the 1993 edition [Hindustan Book Agency, New Delhi]. Classroom Resource Materials Series. Mathematical Association of America, Washington, DC. · Zbl 0940.42001
[3] Fan, J., Zhang, C. and Zhang, J. (2001). Generalized likelihood ratio statistics and Wilks phenomenon. Ann. Statist. 29 153-193. · Zbl 1029.62042 · doi:10.1214/aos/996986505
[4] Fan, J. (1996). Test of significance based on wavelet thresholding and Neyman’s truncation. J. Amer. Statist. Assoc. 91 674-688. JSTOR: · Zbl 0869.62032 · doi:10.2307/2291663
[5] Fan, Y. (1997). Goodness-of-fit tests for a multivariate distribution by the empirical characteristic function. J. Multivariate Anal. 62 36-63. · Zbl 0949.62044 · doi:10.1006/jmva.1997.1672
[6] Fan, Y. (1998). Goodness-of-fit tests based on kernel density estimators with fixed smoothing parameters. Econometric Theory 14 604-621. JSTOR: · doi:10.1017/S0266466698145036
[7] Hastie, T., Tibshirani, R. and Friedman, J. H. (2001). The Elements of Statistical Learning : Data Mining, Inference and Prediction . Springer, New York. · Zbl 0973.62007
[8] Lévy, P. (1939). L’addition des variable aleatoires defines sur une circonference. Bull. Soc. Math. France 67 1-41. · Zbl 0023.05801
[9] Lindsay, B. G., Kettenring, J. and Siegmund, D. O. (2004). A report on the future of statistics (with discussion). Statist. Sci. 19 387-413. · Zbl 1100.62516 · doi:10.1214/088342304000000404
[10] Lindsay, B. G. and Markatou, M. (2002). Statistical Distances : A Global Framework to Inference . Springer, New York.
[11] Liu, Z. J. and Rao, C. R. (1995). Asymptotic distribution of statistics based on quadratic entropy and bootstrapping. J. Statist. Plann. Inference 43 1-18. · Zbl 0812.62016 · doi:10.1016/0378-3758(94)00005-G
[12] Satterthwaite, F. W. (1946). An approximate distribution of estimates of variance components. Biometrics Bull. 2 110-114.
[13] Serfling, R. J. (1980). Approximation Theorems of Mathematical Statistics . Wiley, New York. · Zbl 0538.62002
[14] Silverman, B. W. (1986). Density Estimation for Statistics and Data Analysis . Chapman and Hall, London. · Zbl 0617.62042
[15] Spitzner, D. J. (2006). Use of goodness-of-fit procedures in high dimensional testing. J. Statist. Comput. Simul. 76 447-457. · Zbl 1099.62044 · doi:10.1080/10629360500107907
[16] Thangavelu, S. (1993). Lectures on Hermite and Laguerre Expansions . Princeton Univ. Press. · Zbl 0791.41030
[17] Wintner, A. (1947). On the shape of the angular case of Cauchy’s distribution curves. Ann. Math. Statist. 18 589-593. · Zbl 0030.30703 · doi:10.1214/aoms/1177730351
[18] Yang, K. (2004). Using the Poisson kernel in model building and selection. Ph.D. dissertation, Pennsylvania State Univ.
[19] Yosida, K. (1980). Functional Analysis , 6th ed. Springer, Berlin. · Zbl 0435.46002
[20] Zuo, Y. and He, X. (2006). On the limiting distributions of multivariate depth-based rank sum statistics and related tests. Ann. Statist. 34 2879-2896. · Zbl 1114.62020 · doi:10.1214/009053606000000876
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.