How can we learn a dynamical system to make forecasts, when some variables are unobserved? For instance, in COVID-19, we want to forecast the number of infected and death cases but we do not know the count of susceptible and exposed people. While mechanics compartment models are widely used in epidemic modeling, data-driven models are emerging for disease forecasting. We first formalize the learning of physics-based models as AutoODE, which leverages automatic differentiation to estimate the model parameters. Through a benchmark study on COVID-19 forecasting, we notice that physics-based mechanistic models significantly outperform deep learning. Our method obtains a 57.4% reduction in mean absolute errors for 7-day ahead COVID-19 forecasting compared with the best deep learning competitor. Such performance differences highlight the generalization problem in dynamical system learning due to distribution shift. We identify two scenarios where distribution shift can occur: changes in data domain and changes in parameter domain (system dynamics). Through systematic experiments on several dynamical systems, we found that deep learning models fail to forecast well under both scenarios. While much research on distribution shift has focused on changes in the data domain, our work calls attention to rethink generalization for learning dynamical systems.
翻译:当一些变量未被观测到时,我们如何学习动态系统来作出预测?例如,在COVID-19中,我们想预测受感染和死亡病例的数量,但我们不知道受感染和受辐射人数的多少。虽然机械舱模型在流行病模型中广泛使用,但数据驱动模型正在出现用于疾病预测。我们首先将物理模型的学习正式化为AutoODE,它利用自动差异来估计模型参数。我们通过对COVID-19预测的基准研究发现,基于物理的机械模型大大超过深层学习的成绩。我们的方法在COVID-19预测之前7天的7天的绝对误差中获得了57.4%的平均值的减少,而我们不知道,我们的方法与最深的学习竞争者相比,我们不知道有多少。这种性能差异突出了动态系统学习中因分布变化而出现的普遍问题。我们先确定两种分布变化的情景:数据域的变化和参数域(系统动态动态)的变化。我们发现,深层次的学习模型在两种情景下都无法预测。虽然许多关于分配变化的研究集中在数据域的变化,但我们的工作需要重新思考整个系统。