Complication risk profiling is a key challenge in the healthcare domain due to the complex interaction between heterogeneous entities (e.g., visit, disease, medication) in clinical data. With the availability of real-world clinical data such as electronic health records and insurance claims, many deep learning methods are proposed for complication risk profiling. However, these existing methods face two open challenges. First, data heterogeneity relates to those methods leveraging clinical data from a single view only while the data can be considered from multiple views (e.g., sequence of clinical visits, set of clinical features). Second, generalized prediction relates to most of those methods focusing on single-task learning, whereas each complication onset is predicted independently, leading to suboptimal models. We propose a multi-view multi-task network (MuViTaNet) for predicting the onset of multiple complications to tackle these issues. In particular, MuViTaNet complements patient representation by using a multi-view encoder to effectively extract information by considering clinical data as both sequences of clinical visits and sets of clinical features. In addition, it leverages additional information from both related labeled and unlabeled datasets to generate more generalized representations by using a new multi-task learning scheme for making more accurate predictions. The experimental results show that MuViTaNet outperforms existing methods for profiling the development of cardiac complications in breast cancer survivors. Furthermore, thanks to its multi-view multi-task architecture, MuViTaNet also provides an effective mechanism for interpreting its predictions in multiple perspectives, thereby helping clinicians discover the underlying mechanism triggering the onset and for making better clinical treatments in real-world scenarios.
翻译:由于临床数据中各不同实体(如访问、疾病、医药等)之间的复杂互动,并发症风险剖析是保健领域的一个关键挑战。随着提供电子健康记录和保险索赔等真实世界临床数据,为复杂风险剖析提出了许多深层次的学习方法。然而,这些现有方法面临两个公开的挑战。首先,数据差异性与从单一视角获取临床数据的方法有关,而数据只能从一个视角获取,而数据只能从多种观点(如临床访问的顺序、临床特征集)来考虑)。第二,普遍预测涉及大多数侧重于单项任务学习的方法,而每种并发症发端都是独立预测的,从而导致不完美的模型。我们提议建立一个多目多功能多功能多功能网络(MuViTaNet)网络(MuViTaNet)网络(MuviTaNet),以预测多重并发症的爆发来解决这些问题。特别是,MuviTaNet(MuviTaNet)网络(MuviTa)利用多视角编码来补充病人的代表性,将临床数据作为临床访问的序列和临床特征的发现组。此外,它利用相关标签和不固定的多功能网络的多功能的预测的预测的多功能的预测的预言,也利用从现有模型中进行更精确的预测,从而显示其现有的实验结构中更普遍的数据结构,从而显示其现有数据结构,从而显示其新的分析结果。