We propose a monitoring strategy for efficient and robust estimation of disease prevalence and case numbers within closed and enumerated populations such as schools, workplaces, or retirement communities. The proposed design relies largely on voluntary testing, notoriously biased (e.g., in the case of COVID-19) due to non-representative sampling. The approach yields unbiased and comparatively precise estimates with no assumptions about factors underlying selection of individuals for voluntary testing, building on the strength of what can be a small random sampling component. This component unlocks a previously proposed "anchor stream" estimator, a well-calibrated alternative to classical capture-recapture (CRC) estimators based on two data streams. We show here that this estimator is equivalent to a direct standardization based on "capture", i.e., selection (or not) by the voluntary testing program, made possible by means of a key parameter identified by design. This equivalency simultaneously allows for novel two-stream CRC-like estimation of general means (e.g., of continuous variables such as antibody levels or biomarkers). For inference, we propose adaptations of a Bayesian credible interval when estimating case counts and bootstrapping when estimating means of continuous variables. We use simulations to demonstrate significant precision benefits relative to random sampling alone.
翻译:我们提议了一项监测战略,以便在诸如学校、工作场所或退休社区等封闭和列举的人群中高效和可靠地估计疾病流行率和病例数; 拟议的设计主要依靠自愿测试,由于非代表性抽样(如COVID-19)而明显存在偏差(如COVID-19),因此建议的设计在很大程度上依赖于自愿测试; 这种方法产生不偏不倚和相对准确的估计,对选择自愿测试的个人所依据的因素不作任何假设,而以一个小随机抽样成分的强度为基础; 这个组成部分同时可以释放先前提议的“锁定流”估计器,这是基于两个数据流的典型捕捉-捕捉(CRC)估计器(CRC)的妥善校准替代物; 我们在这里表明,这一估计器相当于基于“捕获”(即选择(或不选择)自愿测试方案)的直接标准化,这是通过设计确定的关键参数加以可能实现的。 这种等等同性同时允许对一般手段进行新型的两种流CRC估计(例如抗体水平或生物标志等连续变量)。