Learning graphical structures based on Directed Acyclic Graphs (DAGs) is a challenging problem, partly owing to the large search space of possible graphs. A recent line of work formulates the structure learning problem as a continuous constrained optimization task using the least squares objective and an algebraic characterization of DAGs. However, the formulation requires a hard DAG constraint and may lead to optimization difficulties. In this paper, we study the asymptotic role of the sparsity and DAG constraints for learning DAG models in the linear Gaussian and non-Gaussian cases, and investigate their usefulness in the finite sample regime. Based on the theoretical results, we formulate a likelihood-based score function, and show that one only has to apply soft sparsity and DAG constraints to learn a DAG equivalent to the ground truth DAG. This leads to an unconstrained optimization problem that is much easier to solve. Using gradient-based optimization and GPU acceleration, our procedure can easily handle thousands of nodes while retaining a high accuracy. Extensive experiments validate the effectiveness of our proposed method and show that the DAG-penalized likelihood objective is indeed favorable over the least squares one with the hard DAG constraint.
翻译:基于定向环形图(DAGs)的学习图形结构是一个具有挑战性的问题,部分原因是可能图表的搜索空间很大。最近的一行工作将结构学习问题作为连续限制优化任务,使用最小方形目标和对DAG的代数定性,将结构学习问题作为连续限制优化任务提出。但是,这种配方需要硬的DAG限制,并可能导致优化困难。在本文中,我们研究在线性高萨和非高萨案例中学习DAG模型以及调查其在有限抽样制度中的有用性方面缺乏空间和DAG制约因素。根据理论结果,我们制定基于可能性的评分功能,并表明只需要使用软的松散和DAG限制来学习相当于地面真象DAG的DAG。这导致一个容易解决的未受限制的优化问题。使用梯度优化和GPU加速,我们的程序可以很容易地处理数千个节点,同时保留高精确度。广泛的实验可以验证我们拟议的方法的有效性,并显示DAG的偏向性目标显示DAG最强的可能性。