Integration of data from multiple omics techniques is becoming increasingly important in biomedical research. Due to non-uniformity and technical limitations in omics platforms, such integrative analyses on multiple omics, which we refer to as views, involve learning from incomplete observations with various view-missing patterns. This is challenging because i) complex interactions within and across observed views need to be properly addressed for optimal predictive power and ii) observations with various view-missing patterns need to be flexibly integrated. To address such challenges, we propose a deep variational information bottleneck (IB) approach for incomplete multi-view observations. Our method applies the IB framework on marginal and joint representations of the observed views to focus on intra-view and inter-view interactions that are relevant for the target. Most importantly, by modeling the joint representations as a product of marginal representations, we can efficiently learn from observed views with various view-missing patterns. Experiments on real-world datasets show that our method consistently achieves gain from data integration and outperforms state-of-the-art benchmarks.
翻译:在生物医学研究中,来自多种显微镜技术的数据的整合正变得越来越重要。由于显微镜平台的不统一和技术限制,对多种显微镜的这种综合分析(我们称之为观点)涉及从不完全的观察中学习,而这种综合分析则涉及从各种视觉偏差模式中学习,这具有挑战性,因为为了最佳预测力,需要适当处理所观测到的多种视觉技术内部和不同观点之间的复杂互动,以及(二)各种视觉偏差模式的观测需要灵活整合。为了应对这些挑战,我们建议对不完整的多视图观测采用深度的变异信息瓶颈(IB)方法。我们的方法是将所观测到的观点的边际和联合表达框架用于侧重于与目标相关的视觉内和视觉间互动。最重要的是,通过将联合表述作为边际表达的产物,我们可以高效率地从各种视觉偏差模式的观察观点中学习。对现实世界数据集的实验表明,我们的方法始终从数据整合和超常规基准中得益。