We consider the optimal decision-making problem in a primary sample of interest with multiple auxiliary sources available. The outcome of interest is limited in the sense that it is only observed in the primary sample. In reality, such multiple data sources may belong to different populations and thus cannot be combined directly. This paper proposes a novel calibrated optimal decision rule (CODR) to address the limited outcome, by leveraging the shared pattern in multiple data sources. Under a mild and testable assumption that the conditional means of intermediate outcomes in different samples are equal given baseline covariates and the treatment information, we can show that the calibrated mean outcome of interest under the CODR is unbiased and more efficient than using the primary sample solely. Extensive experiments on simulated datasets demonstrate empirical validity and improvement of the proposed CODR, followed by a real application on the MIMIC-III as the primary sample with auxiliary data from eICU.
翻译:我们从现有多种辅助来源的原始利益抽样中考虑最佳决策问题,其结果是有限的,因为它只在原始抽样中观察到。在现实中,这种多数据来源可能属于不同的人口,因此不能直接合并。本文件建议采用新的经校准的最佳决策规则(CODR),通过在多个数据来源中利用共享模式,解决有限的结果。根据不同样品中附带条件的中间结果手段等于给定基准变量和处理信息这一可测试的轻度假设,我们可以表明,CODR下经校准的平均利益结果没有偏见,而且比仅使用原始样本更有效。关于模拟数据集的广泛试验表明拟议的CODR的经验有效性和改进,随后对MIMIC-III作为主要样本的MIC-III进行了实际应用,并附有来自eICU的辅助数据。