Independent component analysis (ICA) is a blind source separation method for linear disentanglement of independent latent sources from observed data. We investigate the special setting of noisy linear ICA where the observations are split among different views, each receiving a mixture of shared and individual sources. We prove that the corresponding linear structure is identifiable, and the source distribution can be recovered. To computationally estimate the sources, we optimize a constrained form of the joint log-likelihood of the observed data among all views. We also show empirically that our objective recovers the sources also in the case when the measurements are corrupted by noise. Furthermore, we propose a model selection procedure for recovering the number of shared sources which we verify empirically. Finally, we apply the proposed model in a challenging real-life application, where the estimated shared sources from two large transcriptome datasets (observed data) provided by two different labs (two different views) lead to recovering (shared) sources utilized for finding a plausible representation of the underlying graph structure.
翻译:独立元件分析(ICA) 是一种盲源分离方法,用于将独立潜在来源从观察到的数据中直线分解出来。我们调查噪音线性ICA的特殊设置,即将观测结果分成不同观点,每个观点得到共享和单个来源的混合体。我们证明相应的线性结构是可识别的,源分布情况是可以回收的。为了对来源进行计算估计,我们优化了所有观点之间共同日志相似性的一种限制形式。我们还从经验上表明,在测量结果被噪音破坏的情况下,我们的目标也恢复了源。此外,我们提议了一个模式选择程序,以恢复共享来源的数量,我们从经验上核查。最后,我们应用了拟议模型,在具有挑战性的实际应用中,从两个不同的实验室(两种不同观点)提供的两大笔录数据集(所观察的数据)中估计的共享来源(已知数据)导致回收(共享)用于寻找基本图表结构合理代表性的来源。</s>