Epidemiologic and medical studies often rely on evaluators to obtain measurements of exposures or outcomes for study participants, and valid estimates of associations depends on the quality of data. Even though statistical methods have been proposed to adjust for measurement errors, they often rely on unverifiable assumptions and could lead to biased estimates if those assumptions are violated. Therefore, methods for detecting potential `outlier' evaluators are needed to improve data quality during data collection stage. In this paper, we propose a two-stage algorithm to detect `outlier' evaluators whose evaluation results tend to be higher or lower than their counterparts. In the first stage, evaluators' effects are obtained by fitting a regression model. In the second stage, hypothesis tests are performed to detect `outlier' evaluators, where we consider both the power of each hypothesis test and the false discovery rate (FDR) among all tests. We conduct an extensive simulation study to evaluate the proposed method, and illustrate the method by detecting potential `outlier' audiologists in the data collection stage for the Audiology Assessment Arm of the Conservation of Hearing Study, an epidemiologic study for examining risk factors of hearing loss in the Nurses' Health Study II. Our simulation study shows that our method not only can detect true `outlier' evaluators, but also is less likely to falsely reject true `normal' evaluators. Our two-stage `outlier' detection algorithm is a flexible approach that can effectively detect `outlier' evaluators, and thus data quality can be improved during data collection stage.
翻译:流行病学和医学研究往往依赖评价人员来获得接触量或结果的测量结果,供研究参与者使用,而且协会的有效估计取决于数据的质量。尽管提出了统计方法以适应测量错误,但统计方法往往依赖无法核实的假设,如果这些假设被违反,可能导致有偏差的估计数。因此,在数据收集阶段,需要发现潜在的`外部'评价员的方法来提高数据质量。在本文件中,我们提出一个两阶段算法,以发现`外部'评价员,其评价结果往往高于或低于同行的“外部”评价员。在第一阶段,评价员的效果是通过一个质量回归模型获得的。在第二阶段,进行假设测试是为了检测“外部”评价员,我们考虑每个假设测试的威力和所有测试中的虚假发现率。我们进行广泛的模拟研究,以评价拟议方法,说明在“视听评估”的数据收集阶段发现潜在的`外部'视听专家的方法。在“保存听力研究”的第一阶段,通过评估效果评估员的效果是,“外部”分析员的效果是“外部”的诊断性研究,因此,在“模拟阶段,我们的诊断损失评估方法中,只能进行真正的研究,只有“实地研究,我们的健康评估方法”能够进行。