A class of multivariate distribution-free tests of independence based on graphs.

*(English)*Zbl 1348.62162Summary: A class of distribution-free tests is proposed for the independence of two subsets of response coordinates. The tests are based on the pairwise distances across subjects within each subset of the response. A complete graph is induced by each subset of response coordinates, with the sample points as nodes and the pairwise distances as the edge weights. The proposed test statistic depends only on the rank order of edges in these complete graphs. The response vector may be of any dimensions. In particular, the number of samples may be smaller than the dimensions of the response. The test statistic is shown to have a normal limiting distribution with known expectation and variance under the null hypothesis of independence. The exact distribution free null distribution of the test statistic is given for a sample of size 14, and its Monte Carlo approximation is considered for larger sample sizes. We demonstrate in simulations that this new class of tests has good power properties for very general alternatives.

##### MSC:

62G10 | Nonparametric hypothesis testing |

62H20 | Measures of association (correlation, canonical correlation, etc.) |

05C85 | Graph algorithms (graph-theoretic aspects) |

PDF
BibTeX
XML
Cite

\textit{R. Heller} et al., J. Stat. Plann. Inference 142, No. 12, 3097--3106 (2012; Zbl 1348.62162)

Full Text:
DOI

##### References:

[1] | Benjamini, Y.; Heller, R., Screening for partial conjunction hypotheses, Biometrics, 64, 1215-1222, (2008) · Zbl 1152.62045 |

[2] | Benjamini, Y.; Hochberg, Y., Controlling the false discovery rate—a practical and powerful approach to multiple testing, Journal of the royal statistical society series B: methodological, 57, 1, 289-300, (1995) · Zbl 0809.62014 |

[3] | Bickel, P.; Breiman, L., Sums of functions of nearest neighbor distances, moment bounds, limit theorems and a goodness of fit test, The annals of probability, 11, 1, 185-214, (1983) · Zbl 0502.62045 |

[4] | Friedman, J.; Rafsky, L., Multivariate generalizations of the Wald-wolfowitz and Smirnov two-sample tests, The annals of statistics, 7, 4, 697-717, (1979) · Zbl 0423.62034 |

[5] | Hall, P., Chi squared approximations to the distribution of a sum of independent random variables, The annals of probability, 11, 4, 1028-1036, (1983) · Zbl 0525.60028 |

[6] | Henze, N., A multivariate two-sample test based on the number of nearest neighbor coincidences, Annals of statistics, 16, 772-783, (1988) · Zbl 0645.62062 |

[7] | Hollander, M.; Wolfe, D., Nonparametric statistical methods, (1999), John Wiley & Sons Inc. New York · Zbl 0997.62511 |

[8] | Loughin, T., A systematic comparison of methods for combining p-values from independent tests, Computational statistics and data analysis, 47, 467-485, (2004) · Zbl 1430.62048 |

[9] | Lu, B., Robert, G., Xinyi, X., Beck, C., 2011. Optimal nonbipartite matching and its statistical applications. The American Statistician 65 (1), 21-30. |

[10] | Newton, M., Introducing the discussion paper by szekely and rizzo, The annals of applied statistics, 3, 4, 1233-1235, (2009) · Zbl 05696870 |

[11] | Puri, M.; Sen, P., Nonparametric methods in multivariate analysis, (1971), John Wiley & Sons Inc. New York · Zbl 0237.62033 |

[12] | R Development Core Team, 2011. R: A Language and Environment for Statistical Computing. Vienna, Austria. ISBN 3-900051-07-0. |

[13] | Sakaue-Sawano, A.; Kurokawa, H.; Morimura, T.; Hanyu, A.; Hama, H.; Osawa, H.; Kashiwagi, S.; Fukami, K.; Miyata, T.; Miyoshi, H.; Imamura, T.; Ogawa, M.; Msai, H.; Miyawaki, A., Visualizing spatiotemporal dynamics of multicellular cell-cycle progression, Cell, 132, 3, 487-498, (2008) |

[14] | Seth, P.; Vijaya, R., An optimal minimum spanning tree algorithm, Journal of the association for computing machinery, 49, 1, 16-34, (2002) · Zbl 1323.05124 |

[15] | Szekely, G.; Rizzo, M., Brownian distance covariance, The annals of applied statistics, 3, 4, 1236-1265, (2009) · Zbl 1196.62077 |

[16] | Szekely, G.; Rizzo, M.; Bakirov, N., Measuring and testing independence by correlation of distances, The annals of statistics, 35, 2769-2794, (2007) · Zbl 1129.62059 |

[17] | Taskinen, S.; Oja, H.; Randles, R., Multivariate nonparametric tests of independence, American statistical association, 100, 471, 916-925, (2005) · Zbl 1117.62434 |

[18] | Wallis, W., Compounding probabilities from independent significance tests, Econometrica, 10, 3/4, 229-248, (1942) · Zbl 0063.08144 |

This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.