×

zbMATH — the first resource for mathematics

Testing for homogeneity of multivariate dispersions using dissimilarity measures. (English) Zbl 1270.62078
Summary: Testing homogeneity of dispersions may be of its own scientific interest as well as an important auxiliary step verifying assumptions of a main analysis. The problem is that many biological and ecological data are highly skewed and zero-inflated. Also the number of variables often exceeds the sample size. Thus data analysts often do not rely on parametric assumptions, but use a particular dissimilarity measure to calculate a matrix of pairwise differences. This matrix is then the basis for further statistical inference. M.J. Anderson [Biometrics 62, No. 1, 245–253 (2006; Zbl 1091.62049)] proposed a distance-based test of homogeneity of multivariate dispersions for a one-way ANOVA design, for which a matrix of pairwise dissimilarities can be taken as an input. The key idea, like in Levene’s test, is to replace each observation with its distance to an estimated group center. We suggest an alternative approach that is based on the means of within-group distances and does not require group centre calculations to obtain the test statistic. We show that this approach can have theoretical as well as practical advantages. A permutation procedure that gives type I error close to the prescribed value even in small samples is described.

MSC:
62G10 Nonparametric hypothesis testing
62J10 Analysis of variance and covariance (ANOVA)
Software:
mvabund; sedaR
PDF BibTeX XML Cite
Full Text: DOI
References:
[1] Aitchison, The multivariate Poisson-log normal distribution, Biometrika 76 pp 643– (1989) · Zbl 0679.62040 · doi:10.1093/biomet/76.4.643
[2] Anderson, A new method for non-parametric multivariate analysis of variance, Austral Ecology 26 pp 32– (2001)
[3] Anderson, Distance-based tests for homogeneity of multivariate dispersions, Biometrics 62 pp 245– (2006) · Zbl 1091.62049 · doi:10.1111/j.1541-0420.2005.00440.x
[4] Anderson, Navigating the multiple meanings of \(\beta\) diversity: A roadmap for the practicing ecologist, Ecology Letters 14 pp 19– (2011) · doi:10.1111/j.1461-0248.2010.01552.x
[5] Anderson, Multivariate dispersion as a measure of beta diversity, Ecology Letters 9 pp 683– (2006) · doi:10.1111/j.1461-0248.2006.00926.x
[6] Anderson, Spatial variation and effects of habitat on temperate reef fish assemblages in northeastern New Zealand, Journal of Experimental Marine Biology and Ecology 305 pp 191– (2004) · doi:10.1016/j.jembe.2003.12.011
[7] Bumpus, The elimination of the unfit as illustrated by the introduced sparrow, passer domesticus pp 209– (1899)
[8] Cailliez, The analytical solution of the additive constant problem, Psychometrika 48 pp 305– (1983) · Zbl 0534.62079 · doi:10.1007/BF02294026
[9] Callaert, The order of the normal approximation for a studentized U-statistics, Annals of Statistics 9 pp 194– (1981) · Zbl 0457.62018 · doi:10.1214/aos/1176345347
[10] Campbell, Biology (2008)
[11] Clarke, Non-parametric multivariate analysis of changes in community structure, Australian Journal of Ecology 18 pp 117– (1993) · doi:10.1111/j.1442-9993.1993.tb00438.x
[12] Excoffier, Analysis of molecular variance inferred from metric distances among DNA haplotypes: Application to human mitochondrial DNA restriction data, Genetics 131 pp 479– (1992)
[13] Gower, Analysis of distance for structured multivariate data and extensions to multivariate analysis of variance, Journal of the Royal Statistical Society. Series C. 48 pp 505– (1999) · Zbl 0956.62048 · doi:10.1111/1467-9876.00168
[14] Haldane, Note on the median of a multivariate distribution, Biometrika 35 pp 414– (1948) · Zbl 0032.03601 · doi:10.1093/biomet/35.3-4.414
[15] Legendre, Distance-based redundancy analysis: Testing multispecies responses in multifactorial ecological experiments, Ecological Monographs 69 pp 1– (1999) · doi:10.1890/0012-9615(1999)069[0001:DBRATM]2.0.CO;2
[16] Legendre, Ecologically meaningful transformations for ordination of species data, Oecologia 129 pp 271– (2001) · doi:10.1007/s004420100716
[17] Legendre, Numerical Ecology (1998)
[18] Manly, Multivariate statistical methods. A Primer (2005)
[19] Mantel, A technique of nonparametric multivariate analysis, Biometrics 26 pp 547– (1970) · doi:10.2307/2529108
[20] McArdle, Fitting multivariate models to community data: A comment on distance-based redundancy analysis, Ecology 82 pp 290– (2001) · doi:10.1890/0012-9658(2001)082[0290:FMMTCD]2.0.CO;2
[21] Mielke, A class of nonparametric tests based on multiresponse permutation procedures, Biometrika 68 pp 720– (1981) · doi:10.1093/biomet/68.3.720
[22] O’Brien, Robust procedures for testing equality of covariance matrices, Biometrics 48 pp 819– (1992) · doi:10.2307/2532347
[23] Pillar, On randomization testing in vegetation science: Multifactor comparisons of relevé groups, Journal of Vegetation Science 7 pp 585– (1996) · doi:10.2307/3236308
[24] Rencher, Wiley Series in Probability and Statistics (1998)
[25] Serfling, Approximation Theorems of Mathematical Statistics (1980) · Zbl 0538.62002 · doi:10.1002/9780470316481
[26] Smith, Community similarity and the analysis of multispecies environmental data: A unified statistical approach, Water Research. 24 pp 507– (1990) · doi:10.1016/0043-1354(90)90236-Y
[27] Tiku, Testing the equality of variance-covariance matrices the robust way, Communication in Statistics - Theory and Methods 14 pp 3033– (1985) · Zbl 0588.62089 · doi:10.1080/03610928508829093
[28] Van Valen, The statistics of variation, Evolutionary Theory 4 pp 33– (1978)
[29] Vellend, Homogenization of forest plant communities and weakening of species-environment relationships via agricultural land use, Journal of Ecology 95 pp 565– (2007) · doi:10.1111/j.1365-2745.2007.01233.x
[30] Volaufová, Heteroscedastic ANOVA: old p values, new views, Statistical Papers 50 pp 943– (2009) · Zbl 1247.62187 · doi:10.1007/s00362-009-0262-4
[31] Wang, mvabund- an R package for model-based analysis of multivariate abundance data, Methods in Ecology and Evolution 8 pp 471– (2012) · doi:10.1111/j.2041-210X.2012.00190.x
[32] Warton, Distance-based multivariate analyses confound location and dispersion effects, Methods in Ecology and Evolution 3 pp 89– (2012) · doi:10.1111/j.2041-210X.2011.00127.x
[33] Warwick, A statistical analysis of coral community responses to the 1982-83 El Niňo in the Thousand Islands, Indonesia, Coral Reefs 8 pp 171– (1990) · doi:10.1007/BF00265008
[34] Whittaker, Vegetation of the Siskiyou Mountains, Oregon and California, Ecological Monographs 30 pp 279– (1960) · doi:10.2307/1943563
[35] Whittaker, Evolution and measurement of species diversity, Taxon 21 pp 213– (1972) · doi:10.2307/1218190
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.