Multi-group data are commonly seen in practice. Such data structure consists of data from multiple groups and can be challenging to analyze due to data heterogeneity. We propose a novel Joint and Individual Component Regression (JICO) model to analyze multi-group data. In particular, our proposed model decomposes the response into shared and group-specific components, which are driven by low-rank approximations of joint and individual structures from the predictors respectively. The joint structure has the same regression coefficients across multiple groups, whereas individual structures have group-specific regression coefficients. Moreover, the choice of global and individual ranks allows our model to cover global and group-specific models as special cases. For model estimation, we formulate this framework under the representation of latent components and propose an iterative algorithm to solve for the joint and individual scores under the new representation. To construct the latent scores, we utilize the Continuum Regression (CR), which provides a unified framework that covers the Ordinary Least Squares (OLS), the Partial Least Squares (PLS), and the Principal Component Regression (PCR) as its special cases. We show that JICO attains a good balance between global and group-specific models and remains flexible due to the usage of CR. Finally, we conduct simulation studies and analysis of an Alzheimer's disease dataset to further demonstrate the effectiveness of JICO. R implementation of JICO is available online at https://github.com/peiyaow/JICO.
翻译:这种数据结构由多个组的数据组成,并且由于数据差异性而具有分析挑战性。我们提出一个新的联合和个别部分回归模型(JICO)模型,以分析多组数据。特别是,我们提议的模型将反应分解成共同和特定组的组件,分别由预测者对联合和个别结构的低调近似值驱动。联合结构在不同组别中具有相同的回归系数,而个别结构则有特定组别中的回归系数。此外,全球和个别等级的选择使我们的模型能够将全球和特定组别模型作为特例包括在内。关于模型估计,我们根据潜在组成部分的表述制定这一框架,并提出一个迭代算法,以解决在新的表述下对联合和个别分数的解决方案。为了构建潜在分数,我们利用预测者对连续递增(CRR)的缩放率,这提供了涵盖普通最低方(OLS)、部分最低方(PLS)和主构部分递增(PCRCO)的统一框架,作为特例案例。关于模型的模型的使用,我们最后显示,JICO和CR的精确度分析将持续进行全球和最新数据分析。