The choice of good distances and similarity measures between objects is important for many machine learning methods. Therefore, many metric learning algorithms have been developed in recent years, mainly for Euclidean data in order to improve performance of classification or clustering methods. However, due to difficulties in establishing computable, efficient and differentiable distances between attributed graphs, few metric learning algorithms adapted to graphs have been developed despite the strong interest of the community. In this paper, we address this issue by proposing a new Simple Graph Metric Learning - SGML - model with few trainable parameters based on Simple Graph Convolutional Neural Networks - SGCN - and elements of Optimal Transport theory. This model allows us to build an appropriate distance from a database of labeled (attributed) graphs to improve the performance of simple classification algorithms such as $k$-NN. This distance can be quickly trained while maintaining good performances as illustrated by the experimental study presented in this paper.
翻译:对于许多机器学习方法来说,选择物体之间的距离和相似度度是十分重要的。因此,近年来已经开发了许多衡量学习算法,主要是为欧几里德数据开发的,以便改进分类或集群方法的性能。然而,由于难以在可计算、高效和可区分的图表之间建立可计算、高效和可区分的距离,尽管社区非常感兴趣,但很少开发适应图表的计量算法。在本文件中,我们通过提出一个新的简单图形学习模型-SGML-模型来解决这一问题,该模型没有多少基于简单图形进化神经网络(SGCN)和最佳运输理论要素的可训练参数。这一模型使我们能够从一个有标签的(可分配的)图表数据库中建立适当的距离,以改善诸如$k$-NN的简单分类算法的性能。这一距离可以很快得到培训,同时保持本文实验性研究所显示的良好性能。