Estimating conditional average treatment effects (CATEs) from observational data is relevant in many fields such as personalized medicine. However, in practice, the treatment assignment is usually confounded by unobserved variables and thus introduces bias. A remedy to remove the bias is the use of instrumental variables (IVs). Such settings are widespread in medicine (e.g., trials where the treatment assignment is used as binary IV). In this paper, we propose a novel, multiply robust machine learning framework, called MRIV, for estimating CATEs using binary IVs and thus yield an unbiased CATE estimator. Different from previous work for binary IVs, our framework estimates the CATE directly via a pseudo outcome regression. (1)~We provide a theoretical analysis where we show that our framework yields multiple robust convergence rates: our CATE estimator achieves fast convergence even if several nuisance estimators converge slowly. (2)~We further show that our framework asymptotically outperforms state-of-the-art plug-in IV methods for CATE estimation, in the sense that it achieves a faster rate of convergence if the CATE is smoother than the individual outcome surfaces. (3)~We build upon our theoretical results and propose a tailored deep neural network architecture called MRIV-Net for CATE estimation using binary IVs. Across various computational experiments, we demonstrate empirically that our MRIV-Net achieves state-of-the-art performance. To the best of our knowledge, our MRIV is the first multiply robust machine learning framework tailored to estimating CATEs in the binary IV setting.
翻译:从观察数据中估算有条件平均治疗效果(CATEs)在许多领域(如个性化医学)中具有相关性。然而,在实践中,治疗任务通常由未经观察的变量混在一起,从而引入偏见。消除偏差的一种补救措施是使用工具变量(IVs)。这种设置在医学中很普遍(例如,治疗任务被用作二进制四的试验)。在本文中,我们提议了一个创新的、倍增强的机械学习框架,称为MRIV,用于使用二进制四四分制来估算CATE,从而产生公正的CATE估计器。与以往的二进四四分制工作不同,我们的框架通过假结果回归直接估算CATE。 (1) 我们提供理论分析,让我们显示我们的框架产生多重强的趋同率:我们的CATE估计值即使若干个坏点用作二进制四。 ~ 我们进一步表明,我们的框架,即现代四进制四进制四的CATE估算方法, 也意味着它能够实现一个更快速的深度的内存率。