We consider outbreak detection settings of endemic diseases where the population under study consists of various subpopulations available for stratified surveillance. These subpopulations can for example be based on age cohorts, but may also correspond to other subgroups of the population under study such as international travellers. Rather than sampling uniformly across the population, one may elevate the effectiveness of the detection methodology by optimally choosing a sampling subpopulation. We show (under some assumptions) the relative sampling efficiency between two subpopulations is inversely proportional to the ratio of their respective baseline disease risks. This implies one can increase sampling efficiency by sampling from the subpopulation with higher baseline disease risk. Our results require careful treatment of the power curves of exact binomial tests as a function of their sample size, which are non-monotonic due to the underlying discreteness. A case study of COVID-19 cases in the Netherlands illustrates our theoretical findings.
翻译:我们考虑地方性疾病的暴发检测场景,其中研究人群包含多个可用于分层监测的亚群。这些亚群可以基于年龄队列划分,也可能对应研究人群中的其他子组,例如国际旅行者。相较于在整体人群中均匀抽样,通过优化选择抽样亚群可提升检测方法的有效性。我们证明(在某些假设下),两个亚群间的相对抽样效率与其各自基线疾病风险的比值成反比。这意味着通过从基线疾病风险较高的亚群中抽样,可以提高抽样效率。我们的结果需要精确处理二项检验的功效曲线随样本量变化的特性——由于基础离散性,这些曲线具有非单调性。一项针对荷兰COVID-19病例的案例研究验证了我们的理论发现。