Recovering underlying Directed Acyclic Graph (DAG) structures from observational data is highly challenging due to the combinatorial nature of the DAG-constrained optimization problem. Recently, DAG learning has been cast as a continuous optimization problem by characterizing the DAG constraint as a smooth equality one, generally based on polynomials over adjacency matrices. Existing methods place very small coefficients on high-order polynomial terms for stabilization, since they argue that large coefficients on the higher-order terms are harmful due to numeric exploding. On the contrary, we discover that large coefficients on higher-order terms are beneficial for DAG learning, when the spectral radiuses of the adjacency matrices are small, and that larger coefficients for higher-order terms can approximate the DAG constraints much better than the small counterparts. Based on this, we propose a novel DAG learning method with efficient truncated matrix power iteration to approximate geometric series based DAG constraints. Empirically, our DAG learning method outperforms the previous state-of-the-arts in various settings, often by a factor of $3$ or more in terms of structural Hamming distance.
翻译:由于DAG受限制的优化问题的组合性质,从观测数据中回收直接环形图(DAG)结构的根基是极具挑战性的。最近,DAG学习被定性为连续优化问题,因为将DAG限制定性为光滑的平等问题,一般基于对相邻基体的多元基体。现有方法将非常小的系数置于高阶多式多式稳定条件上,因为它们认为,较高级条件的较大系数对数字爆炸有害。相反,我们发现,高级条件的较大系数有利于DAG学习,当相邻基体的光谱半径很小时,DAG学习是一个连续优化问题,而较高等级条件的系数可能比小对等基体更接近DAG限制。在此基础上,我们建议一种新型的DAG学习方法,具有高效的松动矩阵功率,用于估计基于DAG的限制的几何序列。我们DAG的学习方法往往比以前在不同环境中的状态,或更远端结构值高。