A review of canonical coordinates and an alternative to correspondence analysis using Hellinger distance. (English) Zbl 1167.62421

Summary: In this paper a general theory of canonical coordinates is developed for reduction of dimensionality in multivariate data, assessing the loss of information and plotting higher dimensional data in two or three dimensions for visual displays. The theory is applied to data in two way tables with variables in one category and samples (individual or populations) in the other. Two types of data are considered, one with continuous measurements on the variables and another with frequencies of attributes. An alternative to the usual correspondence analysis of contingency tables based on the Hellinger rather than the chi-square distance is suggested. The new method has some attractive features and does not suffer from some inherent drawbacks resulting from the use of the chi-square distance and variable sample sizes for the populations in the correspondence analysis. The technique of biplots where the populations and the variables are represented on the same chart is discussed


62H17 Contingency tables
62H25 Factor analysis and principal components; correspondence analysis
Full Text: EuDML Link