Graphical models have long been studied in statistics as a tool for inferring conditional independence relationships among a large set of random variables. The most existing works in graphical modeling focus on the cases that the data are Gaussian or mixed and the variables are linearly dependent. In this paper, we propose a double regression method for learning graphical models under the high-dimensional nonlinear and non-Gaussian setting, and prove that the proposed method is consistent under mild conditions. The proposed method works by performing a series of nonparametric conditional independence tests. The conditioning set of each test is reduced via a double regression procedure where a model-free sure independence screening procedure or a sparse deep neural network can be employed. The numerical results indicate that the proposed method works well for high-dimensional nonlinear and non-Gaussian data.
翻译:长期以来,统计学中一直对图形模型进行研究,以此作为在大量随机变量之间推断有条件独立关系的工具,在图形模型中,最现有的工作重点是数据是高斯或混合数据,变量是线性依赖的。在本文中,我们提议了一种双回归法,用于在高维非线性和非加西语设置下学习图形模型,并证明拟议方法在温和条件下是一致的。拟议方法通过进行一系列非参数性有条件独立测试来发挥作用。通过双回归程序减少了每项测试的调制,即采用无模型的确定独立筛选程序或稀薄的深神经网络。数字结果显示,拟议方法在高维非线性和非加西语数据方面运作良好。