We consider the problem of learning a graph from a finite set of noisy graph signal observations, the goal of which is to find a smooth representation of the graph signal. Such a problem is motivated by the desire to infer relational structure in large datasets and has been extensively studied in recent years. Most existing approaches focus on learning a graph on which the observed signals are smooth. However, the learned graph is prone to overfitting, as it does not take the unobserved signals into account. To address this issue, we propose a novel graph learning model based on the distributionally robust optimization methodology, which aims to identify a graph that not only provides a smooth representation of but is also robust against uncertainties in the observed signals. On the statistics side, we establish out-of-sample performance guarantees for our proposed model. On the optimization side, we show that under a mild assumption on the graph signal distribution, our proposed model admits a smooth non-convex optimization formulation. We then develop a projected gradient method to tackle this formulation and establish its convergence guarantees. Our formulation provides a new perspective on regularization in the graph learning setting. Moreover, extensive numerical experiments on both synthetic and real-world data show that our model has comparable yet more robust performance across different populations of observed signals than existing non-robust models according to various metrics.
翻译:我们认为,从一组有限的噪音图形信号观测中学习一张图表是一个问题,其目的在于找到一个平滑的图形信号表示方式。这个问题的起因是希望在大型数据集中推断关联结构,近年来已经对该问题进行了广泛研究。大多数现有方法侧重于学习一个观测到的信号均匀的图表。然而,由于所学的图表没有考虑到未观测到的信号,因此容易过度适应。为了解决这一问题,我们提议了一个基于分布式强力优化方法的新颖的图形学习模型,目的是确定一个不仅能够提供平稳的表示方式,而且能针对所观察到的信号中的不确定因素而强有力。在统计方面,我们为拟议模型建立了超出抽样性能的保证。在优化方面,我们表明,在对图形信号分布的简单假设下,我们提议的模型承认一种平稳的、没有观测到的信号优化公式。然后,我们制定了一个预测的梯度方法来处理这一表述方式,并确立其汇合保证。我们的公式为图表学习设置提供了一个新的视角。此外,对合成和现实世界各种模型中观察到的不可靠的模型进行广泛的数字实验,显示我们现有的不同模型的模型的不具有可比性。