Pandemic management requires that scientists rapidly formulate and analyze epidemiological models in order to forecast the spread of disease and the effects of mitigation strategies. Scientists must modify existing models and create novel ones in light of new biological data and policy changes such as social distancing and vaccination. Traditional scientific modeling workflows detach the structure of a model -- its submodels and their interactions -- from its implementation in software. Consequently, incorporating local changes to model components may require global edits to the code-base through a manual, time-intensive, and error-prone process. We propose a compositional modeling framework that uses high-level algebraic structures to capture domain-specific scientific knowledge and bridge the gap between how scientists think about models and the code that implements them. These algebraic structures, grounded in applied category theory, simplify and expedite modeling tasks such as model specification, stratification, analysis, and calibration. With their structure made explicit, models also become easier to communicate, criticize, and refine in light of stakeholder feedback.
翻译:大流行病管理要求科学家迅速制定和分析流行病学模型,以预测疾病的传播和减缓战略的影响;科学家必须修改现有模型,并根据新的生物数据和政策变化,如社会偏移和疫苗接种,创建新的模型;传统的科学模型工作流程将模型的结构 -- -- 其子模型及其相互作用 -- -- 与软件应用分开,因此,将地方变化纳入模型组成部分可能需要通过人工、时间密集和容易出错的过程对代码库进行全球编辑;我们提议一个组成模型框架,利用高级代数结构捕捉特定领域的科学知识,缩小科学家如何思考模型和执行模型的守则之间的差距;这些代数结构以应用类别理论为基础,简化和加快模型任务,如模型规格、分层、分析和校准等;由于结构明确,模型也更容易根据利益攸关方的反馈进行交流、批评和完善。