Causal effect estimation from observational data is fundamental across various applications. However, selecting an appropriate estimator from dozens of specialized methods demands substantial manual effort and domain expertise. We present CausalPFN, a single transformer that amortizes this workflow: trained once on a large library of simulated data-generating processes that satisfy ignorability, it infers causal effects for new observational datasets out of the box. CausalPFN combines ideas from Bayesian causal inference with the large-scale training protocol of prior-fitted networks (PFNs), learning to map raw observations directly to causal effects without any task-specific adjustment. Our approach achieves superior average performance on heterogeneous and average treatment effect estimation benchmarks (IHDP, Lalonde, ACIC). Moreover, it shows competitive performance for real-world policy making on uplift modeling tasks. CausalPFN provides calibrated uncertainty estimates to support reliable decision-making based on Bayesian principles. This ready-to-use model requires no further training or tuning and takes a step toward automated causal inference (https://github.com/vdblm/CausalPFN/).
翻译:基于观测数据的因果效应估计是众多应用领域的基础任务。然而,从数十种专门方法中选择合适的估计器通常需要大量人工干预和领域专业知识。我们提出了CausalPFN,这是一种通过单一Transformer模型实现工作流摊销的方法:该模型在满足可忽略性假设的大规模模拟数据生成过程库上进行一次性训练,即可直接对新观测数据集进行因果效应推断。CausalPFN将贝叶斯因果推断思想与先验拟合网络(PFNs)的大规模训练协议相结合,学习直接将原始观测数据映射为因果效应,无需任何任务特定调整。我们的方法在异质性与平均处理效应估计基准测试(IHDP、Lalonde、ACIC)中取得了优异的平均性能。此外,在提升建模任务的实际政策制定场景中展现出竞争力。CausalPFN基于贝叶斯原理提供校准的不确定性估计,以支持可靠决策。该即用型模型无需额外训练或调参,为自动化因果推断迈出重要一步(https://github.com/vdblm/CausalPFN/)。