有条件平均治疗效果预测与线性倒退的条件性平均治疗效果预测的比重 (Benign-Overfitting in Conditional Average Treatment Effect Prediction with Linear Regression)

We study the benign overfitting theory in the prediction of the conditional average treatment effect (CATE), with linear regression models. As the development of machine learning for causal inference, a wide range of large-scale models for causality are gaining attention. One problem is that suspicions have been raised that the large-scale models are prone to overfitting to observations with sample selection, hence the large models may not be suitable for causal prediction. In this study, to resolve the suspicious, we investigate on the validity of causal inference methods for overparameterized models, by applying the recent theory of benign overfitting (Bartlett et al., 2020). Specifically, we consider samples whose distribution switches depending on an assignment rule, and study the prediction of CATE with linear models whose dimension diverges to infinity. We focus on two methods: the T-learner, which based on a difference between separately constructed estimators with each treatment group, and the inverse probability weight (IPW)-learner, which solves another regression problem approximated by a propensity score. In both methods, the estimator consists of interpolators that fit the samples perfectly. As a result, we show that the T-learner fails to achieve the consistency except the random assignment, while the IPW-learner converges the risk to zero if the propensity score is known. This difference stems from that the T-learner is unable to preserve eigenspaces of the covariances, which is necessary for benign overfitting in the overparameterized setting. Our result provides new insights into the usage of causal inference methods in the overparameterizated setting, in particular, doubly robust estimators.

翻译：我们研究了在预测有条件平均治疗效果(CATE)时的良性超称理论,并采用了线性回归模型。随着机器因果推断理论的发展,大量因果关系模型的大规模模型正在引起人们的注意。一个问题是,人们怀疑大型模型容易与抽样选择的观测相匹配,因此大型模型可能不适合因果预测。在这项研究中,为了解决可疑问题,我们通过应用最近的良性超配理论(Bartlett等人,2020年),调查过分模型的因果推断方法的有效性。具体地说,我们考虑其分布开关取决于分配规则的样本,并研究CATE的预测,而线性模型的尺寸与无限不同。我们侧重于两种方法:T-learner,它基于与每个治疗组分别构建的估算器之间的差异,以及偏差的概率(IPW)-升度,它解决了另一个近似于性分数的回归问题。在这两种方法中,Tsitemalitality-deal-depliator 的计算方法中,我们无法精确的排序,而我们无法精确的排序则显示我们的排序为直径比。

相关内容

过拟合

关注 8

过拟合，在AI领域多指机器学习得到模型太过复杂，导致在训练集上表现很好，然而在测试集上却不尽人意。过拟合（over-fitting）也称为过学习，它的直观表现是算法在训练集上表现好，但在测试集上表现不好，泛化性能差。过拟合是在模型参数拟合过程中由于训练数据包含抽样误差，在训练时复杂的模型将抽样误差也进行了拟合导致的。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日