In the age of big data, data integration is a critical step especially in the understanding of how diverse data types work together and work separately. Among the data integration methods, the Angle-Based Joint and Individual Variation Explained (AJIVE) is particularly attractive because it not only studies joint behavior but also individual behavior. Typically scores indicate relationships between data objects. The drivers of those relationships are determined by the loadings. A fundamental question is which loadings are statistically significant. A useful approach for assessing this is the jackstraw method. In this paper, we develop jackstraw for the loadings of the AJIVE data analysis. This provides statistical inference about the drivers in both joint and individual feature spaces.
翻译:在大数据时代,数据整合是一个关键步骤,特别是在了解不同数据类型如何相互配合和分别工作方面。在数据整合方法中,基于角度的联合和个人变异解释(AJIVE)特别具有吸引力,因为它不仅研究共同行为,而且研究个人行为。典型的分数显示数据对象之间的关系。这些关系的驱动因素由负荷决定。一个根本问题是,哪些负荷具有统计意义。评估这一方法的有用方法就是粗草法。在本文中,我们为AJIVE数据分析的加载开发了粗线。这为联合和单个特征空间的驱动因素提供了统计推论。