利用低人口群进行CVID-19案例的建模和预测 (Modeling and Forecasting COVID-19 Cases using Latent Subpopulations)

Classical epidemiological models assume homogeneous populations. There have been important extensions to model heterogeneous populations, when the identity of the sub-populations is known, such as age group or geographical location. Here, we propose two new methods to model the number of people infected with COVID-19 over time, each as a linear combination of latent sub-populations -- i.e., when we do not know which person is in which sub-population, and the only available observations are the aggregates across all sub-populations. Method #1 is a dictionary-based approach, which begins with a large number of pre-defined sub-population models (each with its own starting time, shape, etc), then determines the (positive) weight of small (learned) number of sub-populations. Method #2 is a mixture-of-$M$ fittable curves, where $M$, the number of sub-populations to use, is given by the user. Both methods are compatible with any parametric model; here we demonstrate their use with first (a)~Gaussian curves and then (b)~SIR trajectories. We empirically show the performance of the proposed methods, first in (i) modeling the observed data and then in (ii) forecasting the number of infected people 1 to 4 weeks in advance. Across 187 countries, we show that the dictionary approach had the lowest mean absolute percentage error and also the lowest variance when compared with classical SIR models and moreover, it was a strong baseline that outperforms many of the models developed for COVID-19 forecasting.

翻译：典型的流行病学模型假定了同质人口。当已知亚群人口的身份时,例如年龄组或地理位置等, 模型不同人群的模型有重要的扩展。这里, 我们提出两种新的方法来模拟一段时间内受COVID-19感染的人数, 每种方法都是潜在亚群的线性组合 -- -- 即当我们不知道哪个人属于哪个亚群时, 唯一的现有观测数据是所有亚群的总数。方法1 是一种基于字典的方法, 首先是大量预先确定的亚群模式( 每一个有自己开始的时间、形状等), 然后确定小群( 吸收的) 人数的( 积极) 重量。方法2 是一种由潜在亚群群群组成的混合- 美元曲线, 即当我们不知道哪个人属于哪个亚群, 而唯一可用的观测是所有亚群的总数。方法都与任何参数模型兼容; 我们在这里展示了它们的使用, 首先(a) ~ Gussian 曲线, 然后(b) 最初的 ~ SIR 曲线模型, 然后确定小群数亚群数亚群( ) 亚群数 ) 亚组数的亚组数的计算中, 显示最低的预测方法的精确显示的精确的预测方法, 我们所观察到的的的和显示的精确 4 的预测的精确的的的的的。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

专知会员服务

40+阅读 · 2022年10月10日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日