In this paper, we expand upon and refine a monitoring strategy proposed for surveillance of diseases in finite, closed populations. This monitoring strategy consists of augmenting an arbitrarily non-representative data stream (such as a voluntary flu testing program) with a random sample (referred to as an "anchor stream"). This design allows for the use of traditional capture-recapture (CRC) estimators, as well as recently proposed anchor stream estimators that more efficiently utilize the data. Here, we focus on a particularly common situation in which the first data stream only records positive test results, while the anchor stream documents both positives and negatives. Due to the non-representative nature of the first data stream, along with the fact that inference is being performed on a finite, closed population, there are standard and non-standard finite population effects at play. Here, we propose two methods of incorporating finite population corrections (FPCs) for inference, along with an FPC-adjusted Bayesian credible interval. We compare these approaches with existing methods through simulation and demonstrate that the FPC adjustments can lead to considerable gains in precision. Finally, we provide a real data example by applying these methods to estimating the breast cancer recurrence count among Metro Atlanta-area patients in the Georgia Cancer Registry-based Cancer Recurrence Information and Surveillance Program (CRISP) database.
翻译:本文扩展并精化了一种针对有限封闭总体中疾病监测的监控策略。该策略通过将任意非代表性数据流(如自愿流感检测项目)与随机样本(称为“锚定流”)相结合来增强监测能力。这一设计使得传统捕获-再捕获(CRC)估计量以及近期提出的能更高效利用数据的锚定流估计量得以应用。本文重点关注一种特别常见的情形:第一个数据流仅记录阳性检测结果,而锚定流同时记录阳性和阴性结果。由于第一个数据流的非代表性特征,加之推断是在有限封闭总体中进行,标准与非标准的有限总体效应均发挥作用。本文提出了两种纳入有限总体校正(FPC)进行推断的方法,以及一种经FPC调整的贝叶斯可信区间。通过模拟研究,我们将这些方法与现有方法进行比较,证明FPC调整可显著提升估计精度。最后,我们通过将这些方法应用于基于佐治亚癌症登记处的癌症复发信息与监测计划(CRISP)数据库,以估算亚特兰大都会区患者中乳腺癌复发数量的实际数据案例,提供了实证分析。