Under two-phase designs, the outcome and several covariates and confounders are measured in the first phase, and a new predictor of interest, which may be costly to collect, can be measured on a subsample in the second phase, without incurring the costs of recruiting subjects. Information in the first phase can be used to select the second-phase subsample with the goal of improving the efficiency in testing and estimating the effect of the new predictor on the outcome. Past studies have focused on optimal two-phase sampling schemes for statistical inference on local ($\beta = o(1)$) effects of the predictor of interest. Here we extend the two-phase designs with an optimal sampling scheme in case-control studies for estimation predictor effects with pseudo conditional likelihood estimators. The proposed approach is applicable to both local and non-local effects. We demonstrate that the proposed sampling scheme can lead to a substantive improvement in the estimation of the parameter of interest simulation studies and an analysis of data from 170 patients hospitalized for COVID-19.
翻译:在两阶段设计下,在第一阶段对结果和若干共变和混凝土进行测量,在第二阶段的一个子抽样中可以对可能收集费用昂贵的新的利息预测器进行测量,而不引起招募对象的费用;第一阶段的信息可用于选择第二阶段的子抽样,目的是提高测试和估计新预测器对结果的影响的效率;过去的研究侧重于对预测者对当地($\beta=o(1)美元)影响的统计推断的两阶段最佳采样办法;我们在此将两阶段设计扩大为最佳采样办法,在估计预测结果的个案控制研究中采用最佳采样办法,并使用有条件的假冒概率估测算器;拟议的采样办法既适用于当地影响,也适用于非当地影响;我们证明拟议的采样办法可以导致对利息模拟研究参数的估计作出实质性改进,并对COVID-19170名住院病人的数据进行分析。