Compositional data analysis of geological variability and process: a case study.

*(English)*Zbl 1151.86307Summary: Developments in the statistical analysis of compositional data over the last two decades have made possible a much deeper exploration of the nature of variability and the possible processes associated with compositional data sets from many disciplines. In this paper, we concentrate on geochemical data. First, we explain how hypotheses of compositional variability may be formulated within the natural sample space, the unit simplex, including useful hypotheses of sub-compositional discrimination and specific perturbational change. Then we develop through standard methodology, such as generalised likelihood ratio tests, statistical tools to allow the systematic investigation of a lattice of such hypotheses. Some of these tests are simple adaptations of existing multivariate tests but others require special construction. We comment on the use of graphical methods in compositional data analysis and on the ordination of specimens. The recent development of the concept of compositional processes is then explained, together with the necessary tools for a staying-in-the-simplex approach, such as the singular value decomposition of a compositional data set. All these statistical techniques are illustrated for a substantial compositional data set, consisting of 209 major oxide and trace element compositions of metamorphosed limestones from the Grampian Highlands of Scotland. Finally, we discuss some unresolved problems in the statistical analysis of compositional processes.

##### MSC:

86A32 | Geostatistics |

##### Keywords:

lattice hypothesis testing; logistic discriminant analysis; metamorphosed limestones; ordination; perturbation of a composition; singular value decomposition
PDF
BibTeX
XML
Cite

\textit{C. W. Thomas} and \textit{J. Aitchison}, Math. Geol. 37, No. 7, 753--772 (2005; Zbl 1151.86307)

Full Text:
DOI

##### References:

[1] | Aitchison, J., 1981, A new approach to null correlations of proportions: Math. Geol., v. 13, no. 1, p. 48–53. |

[2] | Aitchison, J., 1982, The statistical analysis of compositional data (with discussion): J. R. Stat. Soc. Lond. Ser. B, v. 44, no. 2, p. 139–177. · Zbl 0491.62017 |

[3] | Aitchison, J., 1990, Relative variation diagrams for describing patterns of variability of compositional data: Math. Geol., v. 22, p. 487–512. · doi:10.1007/BF00890330 |

[4] | Aitchison, J., 1997, The one-hour course in compositional data analysis or compositional data analysis is easy, in Pawlowsky Glahn, V., ed., Proceedings of the Third Annual Conference of the International Association for Mathematical Geology, CIMNE, Barcelona, p. 3–35. |

[5] | Aitchison, J., 2001, Simplicial inference, in Viana, M. A. G., and Richards, D. St. P., eds., Algebraic methods in statistics and probability: Contemporary mathematics series 287: American Mathematical Society, Providence, Rhode Island, p. 1–22. · Zbl 1012.62002 |

[6] | Aitchison, J., 2003a, The statistical analysis of compositional data: (Reprint) Blackburn Press, Caldwell, New Jersey, 416 p. |

[7] | Aitchison, J., 2003b, Compositional data analysis: Where are we and where are we heading?: Keynote lecture at CoDaWork03 summarised in this special issue. |

[8] | Aitchison, J., and Barceló-Vidal, C., 2002, Compositional processes: A statistical search for understanding: Terra Nostra, special issue: Eighth Annual Conference of the International Association for Mathemaical Geology, v. 3, p. 381–386. |

[9] | Aitchison, J., Barceló-Vidal, C., Egozcue, J. J., and Pawlowsky-Glahn, V. 2002, A concise guide to the algebraic–geometric structure of the simplex, the sample space for compositional data analysis: Terra Nostra, special issue: Eighth Annual Conference of the International Association for Mathematical Geology, v. 3, p. 387–392. |

[10] | Aitchison, J., and Greenacre, M., 2002, Biplots for compositional data: Appl. Stat., v. 51, p. 375–382. · Zbl 1111.62300 |

[11] | Aitchison, J., and Thomas, C. W., 1998, Differential perturbation processes: A tool for the study of compositional processes, in Buccianti, A., Nardi, G., and Potenza, R., eds., Proceedings of the Fourth Annual Conference of the International Association for Mathematical Geology, De Frede Naplesi, p. 499–504. |

[12] | Azzalini, A., and Dalla Valle, A., 1996, The multivariate skew-normal distribution: Biometrika, v. 83, p. 715–726. · Zbl 0885.62062 · doi:10.1093/biomet/83.4.715 |

[13] | Chayes, F., 1960, On correlations between variables of constant sum: J. Geophys. Res., v. 65, p. 4185–4193. · doi:10.1029/JZ065i012p04185 |

[14] | Chayes, F., 1962, Numerical correlations and petrographic variation: Math. Geol., v. 70, p. 440–452. |

[15] | Chayes, F., 1964, Variance–covariance relations in some published Harker diagrams of volcanic suites: J. Petrol., v. 5, 219–237. |

[16] | Chayes, F., 1971, Ratio correlation: University of Chicago Press, Chicago, 99 p. |

[17] | Harris, A. L., Haselock, P. J., Kennedy, M. J., and Mendum, J R., 1994, The Dalradian Supergroup in Scotland, Shetland and Ireland, in Gibbons, W., and Harris, A. L., eds., A revised correlation of Precambrian rock in the British Isles: Geological Society of London Special Report, 22, p. 33–53. |

[18] | Gabriel, K. R., 1971, The biplot-graphic display of matrices with application to principal component analysis. Biometrika, v. 58, no. 3, p. 453–467. · Zbl 0228.62034 · doi:10.1093/biomet/58.3.453 |

[19] | Hickman, A. H., and Wright, A. E., 1983, Geochemistry and chemostratigraphical correlation of slates, marbles and quartzites of the Appin Group, Argyll: Trans. R. Soc. Edinburgh, Earth Sci., v. 73, p. 251–278. |

[20] | Mateu-Figueras, G., Barceló-Vidal, C., and Pawlowsky-Glahn, V., 1998, Modeling compositional data with multivariate skew-normal distributions, in Buccianti, A., Nardi, G., and Potenza, R., eds., Proceedings of the Fourth Annual Conference of the International Association for Mathematical Geology, De Frede, Naples, p. 532–537. |

[21] | Rock, N. M. S., 1988, Numerical geology. Lecture notes in earth sciences, 18: Springer-Verlag, Berlin, 427 p. |

[22] | Rollinson, H. R., 1993, Using geochemical data: Evaluation, presentation, interpretation: Geochemistry series: Longman, Harlow (UK), 352 p. |

[23] | Stephenson, D., and Gould, D., 1995, The Grampian Highlands, 4th Edn. British Regional Geology: Her Majesty’s Stationary Office, London, 261 p. |

[24] | Strachan, R. A., Smith, M., Harris, A. L., and Fettes, D. J., 2002, The Northern Highland and Grampian Terranes, in Trewin, N. H., ed., The geology of Scotland, 4th Edn. The Geological Society, London, p. 81–147. |

[25] | Thomas, C. W., 1989, Application of geochemistry to the stratigraphic correlation of Appin and Argyll Group carbonate rocks from the Dalradian of northeast Scotland: J. Geol. Soc., v. 146, p. 631–647. · doi:10.1144/gsjgs.146.4.0631 |

[26] | Thomas, C. W., 1993, Sources of rare earth elements in Appin Group limestones, Dalradian, north-east Scotland: Mineral. Petrol., v. 49, p. 27–44. · doi:10.1007/BF01162924 |

[27] | Thomas, C. W., 2000, The isotope geochemistry and petrology of Dalradian carbonate rocks: Ph.D. Thesis, University of Edinburgh, 510 p. |

[28] | Thomas, C. W., and Aitchison, J., 1998, The use of logratios in subcompositional analysis and geochemical discrimination of metamorphosed limestones from the northeast and central Scottish Highlands, in Buccianti, A., Nardi, G., and Potenza, R., eds., Proceedings of IAMG98, The Fourth Annual Conference of the International Association for Mathematical Geology, De Frede, Naples, p. 549–554. |

[29] | von Eynatten, H., Pawlowsky-Glahn, V., and Egozcue, J.-J., 2002, Understanding perturbation on the simplex: A simple method to better visualise and interpret compositional data in ternary diagrams: Math. Geol., v. 34, p. 249–257. · Zbl 1031.86008 · doi:10.1023/A:1014826205533 |

This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.