Compositional data analysis is concerned with multivariate data that have a constant sum, usually 1 or 100\%. These are data often found in biochemistry and geochemistry, but also in the social sciences, when relative values are of interest rather than the raw values. Recent applications are in the area of very high-dimensional "omics" data. Logratios are frequently used for this type of data, i.e. the logarithms of ratios of the components of the data vectors. These ratios raise interesting issues in matrix-vector representation, computation and interpretation, which will be dealt with in this chapter.
翻译:构成数据分析涉及具有恒定和的多变量数据,通常为1或100 ⁇ 。这些数据通常见于生物化学和地球化学中,但也见于社会科学中,因为相对值是有意义的,而不是原始值。最近的应用涉及非常高维的“组群”数据领域。这类数据经常使用Logratios,即数据矢量各组成部分比率的对数。这些比率在矩阵-矢量的表述、计算和解释中提出了有趣的问题,本章将讨论这些问题。