Identifying cause-effect relations among variables is a key step in the decision-making process. While causal inference requires randomized experiments, researchers and policymakers are increasingly using observational studies to test causal hypotheses due to the wide availability of observational data and the infeasibility of experiments. The matching method is the most used technique to make causal inference from observational data. However, the pair assignment process in one-to-one matching creates uncertainty in the inference because of different choices made by the experimenter. Recently, discrete optimization models are proposed to tackle such uncertainty. Although a robust inference is possible with discrete optimization models, they produce nonlinear problems and lack scalability. In this work, we propose greedy algorithms to solve the robust causal inference test instances from observational data with continuous outcomes. We propose a unique framework to reformulate the nonlinear binary optimization problems as feasibility problems. By leveraging the structure of the feasibility formulation, we develop greedy schemes that are efficient in solving robust test problems. In many cases, the proposed algorithms achieve global optimal solutions. We perform experiments on three real-world datasets to demonstrate the effectiveness of the proposed algorithms and compare our result with the state-of-the-art solver. Our experiments show that the proposed algorithms significantly outperform the exact method in terms of computation time while achieving the same conclusion for causal tests. Both numerical experiments and complexity analysis demonstrate that the proposed algorithms ensure the scalability required for harnessing the power of big data in the decision-making process.
翻译:确定变量之间的因果关系是决策过程中的一个关键步骤。虽然因果关系推断需要随机的实验,但研究人员和决策者越来越多地利用观察研究来测试因果假设,因为观测数据的广泛可得性和实验的不可行性。匹配方法是最用来从观测数据中作出因果推断的技术。然而,一对一对一匹配的配对分配过程由于实验者作出不同的选择而在推论中造成了不确定性。最近,为了解决这种不确定性,提出了离散的复杂度优化模型。虽然采用离散优化模型是可能的,但它们会产生非线性问题和缺乏可缩放性。在这项工作中,我们提出贪婪的算法,以解决从持续结果的观测数据中得出的因果推论。我们提出了一个独特的框架,将非线性双轨优化问题作为可行性问题重新表述。我们通过利用可行性拟定的结构,制定了在解决稳健的测试问题方面具有效率的贪婪计划。在许多情况下,拟议的算法可以确保全球最佳解决办法。我们在三个真实世界的逻辑分析中进行实验,同时将我们提出的数值的数值推算法的精确性方法展示了我们提出的精确的计算结果。