近似准准准优利于差分失灵时间结果和暴露的方法 (An Approximate Quasi-Likelihood Approach for Error-Prone Failure Time Outcomes and Exposures)

from arxiv, 61 pages, 1 figure, 14 tables in total. Main manuscript: first 38 pages including references and 6 tables, followed by supplementary materials with remaining 23 pages including 1 figure and 8 tables

Measurement error arises commonly in clinical research settings that rely on data from electronic health records or large observational cohorts. In particular, self-reported outcomes are typical in cohort studies for chronic diseases such as diabetes in order to avoid the burden of expensive diagnostic tests. Dietary intake, which is also commonly collected by self-report and subject to measurement error, is a major factor linked to diabetes and other chronic diseases. These errors can bias exposure-disease associations that ultimately can mislead clinical decision-making. We have extended an existing semiparametric likelihood-based method for handling error-prone, discrete failure time outcomes to also address covariate error. We conduct an extensive numerical study to compare the proposed method to the naive approach that ignores measurement error in terms of bias and efficiency in the estimation of the regression parameter of interest. In all settings considered, the proposed method showed minimal bias and maintained coverage probability, thus outperforming the naive analysis which showed extreme bias and low coverage. This method is applied to data from the Women's Health Initiative to assess the association between energy and protein intake and the risk of incident diabetes mellitus. Our results show that correcting for errors in both the self-reported outcome and dietary exposures leads to considerably different hazard ratio estimates than those from analyses that ignore measurement error, which demonstrates the importance of correcting for both outcome and covariate error. Computational details and R code for implementing the proposed method are presented in Section S1 of the Supplementary Materials.

翻译：在依赖电子健康记录或大型观察组的数据的临床研究环境中,通常会发生测量错误;在依赖电子健康记录或大型观察组群的数据的临床研究环境中,通常会发生测量错误;特别是,在糖尿病等慢性病的群研究中,自我报告的结果是典型的,以避免昂贵的诊断测试负担的负担; 饮食摄入,也是通过自我报告收集的,通常会受到测量错误的影响,这是与糖尿病和其他慢性疾病有关的一个主要因素; 这些错误可能偏向暴露和疾病协会,最终会误导临床决策; 我们推广了一种处理易出错、离散的错误时间结果的半参数性概率法; 我们进行了广泛的数字研究,将拟议的方法与天真的方法进行比较,这种方法忽视了在估计利息回归参数时在偏差和效率方面的误差; 在考虑的所有环境中,拟议方法都表现出最小的偏差和保持了覆盖概率,从而超过了显示极端偏差和低覆盖率的天真分析; 将这一方法应用于妇女保健倡议中的数据,以评估能源和蛋白摄入和糖尿病风险之间的关联; 我们的结果显示,在进行自我报告误差的误差方面,从进行误差分析的结果显示误差的误差是误差的误差的误差,对误差的误差的误差结果的误差比误差的误差的误差是误差,结果的结果。