In many domains, including healthcare, biology, and climate science, time series are irregularly sampled with variable time between successive observations and different subsets of variables (sensors) are observed at different time points, even after alignment to start events. These data create multiple challenges for prevailing models that assume fully observed and fixed-length feature representations. To address these challenges, it is essential to understand the relationships between sensors and how they evolve over time. Here, we introduce RAINDROP, a graph-guided network for learning representations of irregularly sampled multivariate time series. RAINDROP represents every sample as a graph, where nodes indicate sensors and edges represent dependencies between them. RAINDROP models dependencies between sensors using neural message passing and temporal self-attention. It considers both inter-sensor relationships shared across samples and those unique to each sample that can vary with time, and it adaptively estimates misaligned observations based on nearby observations. We use RAINDROP to classify time series and interpret temporal dynamics of three healthcare and human activity datasets. RAINDROP outperforms state-of-the-art methods by up to 11.4% (absolute points in F1 score), including methods that deal with irregular sampling using fixed discretization and set functions, and even in challenging leave-sensor-out settings and setups that require generalizing to new patient groups.
翻译:在许多领域,包括保健、生物学和气候科学,时间序列不定期抽样,在相继观测和不同变量子集(传感器)之间不同时间点观察到不同时间点之间的不同时间点,即使在对齐之后,也观察到不同时间点(传感器),这些数据对具有完全观察和固定长度特征的当前模型构成多重挑战。为了应对这些挑战,必须了解传感器之间的关系以及它们随时间变化而变化的方式。这里,我们引入一个图表指导网络,即REANDROP,用于学习非常规抽样多变时间序列的演示。RAINDROP代表每个样本,作为图表,其中节点显示传感器和边缘代表它们之间的依赖性。RAINDROP模型在使用神经信息传递和时间间隔自我注意的传感器之间产生多种依赖性关系。它考虑到各样本之间共有的关系以及每个样本之间独特的关系,以及它们随时间变化而变化的方式。在这里,我们采用REANDROPOP, 来分类时间序列,并解释三种医疗保健和人类活动数据集的时间动态。REANDROPS 超越传感器的状态和边缘结构, 包括固定的定序法和分级的分级法,要求采用11级方法。