CSCI: 概率时间序列估计的有条件计分模型 (CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation)

The imputation of missing values in time series has many applications in healthcare and finance. While autoregressive models are natural candidates for time series imputation, score-based diffusion models have recently outperformed existing counterparts including autoregressive models in many tasks such as image generation and audio synthesis, and would be promising for time series imputation. In this paper, we propose Conditional Score-based Diffusion models for Imputation (CSDI), a novel time series imputation method that utilizes score-based diffusion models conditioned on observed data. Unlike existing score-based approaches, the conditional diffusion model is explicitly trained for imputation and can exploit correlations between observed values. On healthcare and environmental data, CSDI improves by 40-65% over existing probabilistic imputation methods on popular performance metrics. In addition, deterministic imputation by CSDI reduces the error by 5-20% compared to the state-of-the-art deterministic imputation methods. Furthermore, CSDI can also be applied to time series interpolation and probabilistic forecasting, and is competitive with existing baselines. The code is available at https://github.com/ermongroup/CSDI.

翻译：时间序列中缺失值的估算在医疗保健和融资方面有许多应用。虽然自动递减模型是时间序列估算的自然候选物,但基于分数的传播模型最近优于现有模型,包括图像生成和音频合成等许多任务中的自动递减模型,而且对时间序列估算很有希望。在本文中,我们提出了基于条件的分数推算模型(CSDI),这是使用以观察数据为条件的基于分数的传播模型的新型时间序列估算方法。与现有的基于分数的方法不同,有条件的推广模型是明确的估算方法,可以利用观察到的值之间的关联。在保健和环境数据方面,CISDI比现有的流行性绩效指标的概率推算方法提高了40-65%。此外,CSMI的确定性估算方法比以观察数据为条件的推算方法减少了5-20%。此外,CSISI还可以在时间序列内断分数和预测/预测组之间应用有条件的推广模型。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/