One of the key objectives in many fields in machine learning is to discover causal relationships among a set of variables from observational data. In linear non-Gaussian acyclic models (LiNGAM), it can be shown that the true underlying causal structure can be identified uniquely from merely observational data. DirectLiNGAM algorithm is a well-known solution to learn the true causal structure in high dimensional setting. DirectLiNGAM algorithm executes in a sequence of iterations and it performs a set of comparisons between pairs of variables in each iteration. Unfortunately, the runtime of this algorithm grows significantly as the number of variables increases. In this paper, we propose a parallel algorithm, called ParaLiNGAM, to learn casual structures based on DirectLiNGAM algorithm. We propose a threshold mechanism that can reduce the number of comparisons remarkably compared with the sequential solution. Moreover, in order to further reduce runtime, we employ a messaging mechanism between workers and derive some mathematical formulations to simplify the execution of comparisons. We also present an implementation of ParaLiNGAM on GPU, considering hardware constraints. Experimental results on synthetic and real data show that the implementation of proposed algorithm on GPU can outperform DirectLiNGAM by a factor up to 4600 X.
翻译:机器学习的许多领域的关键目标之一是发现观察数据的一组变数之间的因果关系。 在观测数据的一组变数中,可以证明真正的内在因果结构可以仅从观测数据中确定。直接的LiNGAM算法是一个众所周知的解决方案,可以在高维环境中了解真正的因果结构。直接的LiNGAM算法按迭代顺序执行,并在每迭代中对一对变数进行一系列比较。不幸的是,这一算法的运行时间随着变数的增加而大幅增长。在本文件中,我们提议了一种平行的算法,称为ParaLiNGAM,以学习以直接的LiNGAM算法为基础的临时结构。我们建议了一个阈值机制,可以显著减少与连续解决方案相比的比较次数。此外,为了进一步减少运行时间,我们采用了工人之间的信息传递机制,并得出一些数学配方,以简化比较的执行。我们还介绍了在GPU上应用PALINGAM的运行时间,以考虑硬件限制。我们提议在GPU上采用PLI的实验性结果,通过GPO的GPOM 和真实的GPI的SA值来显示GPI的GM的SL。