将生存数据纳入事件和常见案件的个案控制研究 (Incorporating survival data into case-control studies with incident and prevalent cases)

Typically, case-control studies to estimate odds-ratios associating risk factors with disease incidence from logistic regression only include cases with newly diagnosed disease. Recently proposed methods allow incorporating information on prevalent cases, individuals who survived from disease diagnosis to sampling, into cross-sectionally sampled case-control studies under parametric assumptions for the survival time after diagnosis. Here we propose and study methods to additionally use prospectively observed survival times from prevalent and incident cases to adjust logistic models for the time between disease diagnosis and sampling, the backward time, for prevalent cases. This adjustment yields unbiased odds-ratio estimates from case-control studies that include prevalent cases. We propose a computationally simple two-step generalized method-of-moments estimation procedure. First, we estimate the survival distribution based on a semi-parametric Cox model using an expectation-maximization algorithm that yields fully efficient estimates and accommodates left truncation for the prevalent cases and right censoring. Then, we use the estimated survival distribution in an extension of the logistic model to three groups (controls, incident and prevalent cases), to accommodate the survival bias in prevalent cases. In simulations, when the amount of censoring was modest, odds-ratios from the two-step procedure were equally efficient as those estimated by jointly optimizing the logistic and survival data likelihoods under parametric assumptions. Even with 90% censoring they were as efficient as estimates obtained using only cross-sectionally available information under parametric assumptions. This indicates that utilizing prospective survival data from the cases lessens model dependency and improves precision of association estimates for case-control studies with prevalent cases.

翻译：通常情况下,通过病例控制研究来估计从后勤回归中将风险因素与疾病发病率相联系的概率差值,仅包括新诊断的疾病病例。最近建议的方法允许将有关流行病例的信息,即从疾病诊断到取样的存活者,纳入诊断后存活时间的参数假设下的跨部门抽样病例控制研究中。我们在这里提出并研究各种方法,以便进一步利用从流行和事故案例到潜在观察的存活时间,从普遍案例的诊断和取样到落后时间,以调整常见案例之间的后勤模型。这种调整从包括流行案例的病例在内的病例控制研究中得出不偏不倚的概率差值差值差值估计。我们建议采用一种简单、两步通用方法估算程序,即从疾病诊断到取样,将一般病例的精确率分析纳入到抽样分析中。首先,我们根据半参数Cox模型评估生存分布情况,使用完全高效的估计数,并满足常见案例和正常检查时间差值检查;然后,我们使用估计的存活模式,将估计模式扩大到三个群体(控制、事件和常见案例),以适应常见案例为生存偏差值。在模拟中,通过共同检验的假设,根据共同评估的概率假设,这种估计,从正常数据推算算算的概率假设为平均的概率假设的概率差值假设,这些假设,从可算算算为平均的概率差差值假设,这些假设,从正常的概率推算。

相关内容

CASES

关注 4

CASES：International Conference on Compilers, Architectures, and Synthesis for Embedded Systems。 Explanation：嵌入式系统编译器、体系结构和综合国际会议。 Publisher：ACM。 SIT： http://dblp.uni-trier.de/db/conf/cases/index.html

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【MLSS2020】流数据贝叶斯预测，米兰Sonia Petrone教授，80页ppt

专知会员服务

48+阅读 · 2020年7月5日

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日