Causal discovery is a major task with the utmost importance for machine learning since causal structures can enable models to go beyond pure correlation-based inference and significantly boost their performance. However, finding causal structures from data poses a significant challenge both in computational effort and accuracy, let alone its impossibility without interventions in general. In this paper, we develop a meta-reinforcement learning algorithm that performs causal discovery by learning to perform interventions such that it can construct an explicit causal graph. Apart from being useful for possible downstream applications, the estimated causal graph also provides an explanation for the data-generating process. In this article, we show that our algorithm estimates a good graph compared to the SOTA approaches, even in environments whose underlying causal structure is previously unseen. Further, we make an ablation study that shows how learning interventions contribute to the overall performance of our approach. We conclude that interventions indeed help boost the performance, efficiently yielding an accurate estimate of the causal structure of a possibly unseen environment.
翻译:由于因果结构可以使模型超越纯粹基于关联的推断,大大提升其性能。然而,从数据中找到因果结构在计算努力和准确性方面都构成重大挑战,更不用说在没有一般性干预的情况下是不可能的。在本文中,我们开发了一个元强化学习算法,通过学习实施干预措施来进行因果发现,从而可以构建明确的因果图表。估计因果图除了对可能的下游应用有用外,还为数据生成过程提供了解释。在本篇文章中,我们显示我们的算法与SOTA方法相比,我们估算了一个好的图表,即使是在以前未见其根本因果结构的环境也是如此。此外,我们做了一个反差研究,表明学习干预措施如何有助于我们方法的总体绩效。我们的结论是,干预措施确实有助于提高绩效,有效地估算可能看不见的环境的因果结构。