Measuring treatment effects in observational studies is challenging because of confounding bias. Confounding occurs when a variable affects both the treatment and the outcome. Traditional methods such as propensity score matching estimate treatment effects by conditioning on the confounders. Recent literature has presented new methods that use machine learning to predict the counterfactuals in observational studies which then allow for estimating treatment effects. These studies however, have been applied to real world data where the true treatment effects have not been known. This study aimed to study the effectiveness of this counterfactual prediction method by simulating two main scenarios: with and without confounding. Each type also included linear and non-linear relationships between input and output data. The key item in the simulations was that we generated known true causal effects. Linear regression, lasso regression and random forest models were used to predict the counterfactuals and treatment effects. These were compared these with the true treatment effect as well as a naive treatment effect. The results show that the most important factor in whether this machine learning method performs well, is the degree of non-linearity in the data. Surprisingly, for both non-confounding \textit{and} confounding, the machine learning models all performed well on the linear dataset. However, when non-linearity was introduced, the models performed very poorly. Therefore under the conditions of this simulation study, the machine learning method performs well under conditions of linearity, even if confounding is present, but at this stage should not be trusted when non-linearity is introduced.
翻译:在观察研究中,测量治疗效果是困难的。当变量影响治疗和结果时,就会发生混乱。传统方法,例如偏好性评分与估计治疗效果相匹配,通过对混结者进行调节。最近的文献展示了使用机器学习的新方法,以预测观察研究中的反事实,从而可以估计治疗效果。然而,这些研究应用到真实的治疗效果并不知道真实治疗效果的真实世界数据中。这一研究的目的是通过模拟两种主要情景来研究这一反事实预测方法的有效性:有的和没有混结。每种类型还包含非输入和产出数据之间的线性和非线性关系。模拟中的关键项目是我们所知道的真正因果关系。线性回归、拉索回归和随机森林模型被用来预测反事实和治疗效果。这些研究把这些与真实的治疗效果和天真的治疗效果进行了比较。结果显示,目前机器学习方法是否正确性的最重要因素是数据的非线性程度。对于非线性模型来说,在不精确性研究中进行这种模拟期间,在不精确的模型下,在进行精确性研究期间,在进行这种模拟研究时,在进行不精确性研究时,在进行这种模拟中进行这种模拟的模型的模拟中进行中进行。