The R package DoubleML implements the double/debiased machine learning framework of Chernozhukov et al. (2018). It provides functionalities to estimate parameters in causal models based on machine learning methods. The double machine learning framework consist of three key ingredients: Neyman orthogonality, high-quality machine learning estimation and sample splitting. Estimation of nuisance components can be performed by various state-of-the-art machine learning methods that are available in the mlr3 ecosystem. DoubleML makes it possible to perform inference in a variety of causal models, including partially linear and interactive regression models and their extensions to instrumental variable estimation. The object-oriented implementation of DoubleML enables a high flexibility for the model specification and makes it easily extendable. This paper serves as an introduction to the double machine learning framework and the R package DoubleML. In reproducible code examples with simulated and real data sets, we demonstrate how DoubleML users can perform valid inference based on machine learning methods.
翻译:R 包“双ML ” 执行切尔诺祖科夫等人的双偏向机器学习框架(2018年),提供根据机器学习方法估计因果模型参数的功能;双机器学习框架由三个关键要素组成:内曼正方形、高质量机器学习估计和样本分离;通过在 mlr3 生态系统中现有的各种最先进的机器学习方法,可以对骚扰部分进行估计;双ML 使得有可能在各种因果模型中进行推断,包括部分线性和互动回归模型及其扩展至工具变量估计。面向对象的双ML 实施使模型规格具有高度灵活性,并使其易于扩展。本文是双机器学习框架和R包“双ML”的导言。在模拟和真实数据集的可复制代码示例中,我们演示双ML用户如何根据机器学习方法进行有效的推断。