A further understanding of cause and effect within observational data is critical across many domains, such as economics, health care, public policy, web mining, online advertising, and marketing campaigns. Although significant advances have been made to overcome the challenges in causal effect estimation with observational data, such as missing counterfactual outcomes and selection bias between treatment and control groups, the existing methods mainly focus on source-specific and stationary observational data. Such learning strategies assume that all observational data are already available during the training phase and from only one source. This practical concern of accessibility is ubiquitous in various academic and industrial applications. That's what it boiled down to: in the era of big data, we face new challenges in causal inference with observational data, i.e., the extensibility for incrementally available observational data, the adaptability for extra domain adaptation problem except for the imbalance between treatment and control groups, and the accessibility for an enormous amount of data. In this position paper, we formally define the problem of continual treatment effect estimation, describe its research challenges, and then present possible solutions to this problem. Moreover, we will discuss future research directions on this topic.
翻译:进一步理解观察数据内的原因和影响在许多领域至关重要,如经济学、保健、公共政策、网上采矿、在线广告和营销运动等。虽然在克服通过观察数据进行因果关系估计的挑战方面取得了显著进展,例如缺乏反事实结果和治疗与控制群体之间的选择偏差,但现有方法主要侧重于源源和固定观察数据。这种学习战略假定,所有观测数据在培训阶段都已经存在,而且只能从一个来源获得。这种实际的可获取性关切在各种学术和工业应用中普遍存在。这就是它归结为:在大数据时代,我们在对观测数据进行因果推断方面面临新的挑战,即:可逐步获得的观测数据的可扩展性、除治疗与控制群体之间不平衡外对额外领域适应问题的适应性,以及大量数据的可获取性。在本立场文件中,我们正式界定持续治疗效果估计问题,描述其研究挑战,然后提出这一问题的可能解决办法。此外,我们将讨论这一专题的未来研究方向。