Given a graph learning task, such as link prediction, on a new graph dataset, how can we automatically select the best method as well as its hyperparameters (collectively called a model)? Model selection for graph learning has been largely ad hoc. A typical approach has been to apply popular methods to new datasets, but this is often suboptimal. On the other hand, systematically comparing models on the new graph quickly becomes too costly, or even impractical. In this work, we develop the first meta-learning approach for automatic graph machine learning, called AutoGML, which utilizes the prior performances of existing methods on a wide variety of benchmark graph datasets to automatically select an effective model for the new graph, without any model training or evaluations. To capture the similarity across graphs from different domains, we introduce specialized meta-graph features that quantify the structural characteristics of a graph. Then we design a meta-graph that represents the relations among models and graphs, and develop a graph meta-learner operating on the meta-graph, which estimates the relevance of each model to different graphs. Through extensive experiments, we show that using AutoGML to select a method for the new graph significantly outperforms consistently applying popular methods as well as several existing meta-learners, while being extremely fast at test time.
翻译:考虑到图表学习任务,例如链接预测,在新的图表数据集中,我们如何自动选择最佳方法及其超参数(统称为模型)?图表学习的模型选择基本上是临时性的。典型的方法是将流行的方法应用于新的数据集,但这往往是次最佳的。另一方面,系统比较新图表中的模型很快变得过于昂贵,甚至不切实际。在这项工作中,我们开发了第一个自动图形机学习的元学习方法,称为AutoGML,它利用了在各种基准图形数据集上现有方法的先前性能,为新的图形自动选择一个有效的模型,而无需任何模式培训或评价。为了从不同领域获取相似的图形,我们引入了专门地理学特征,以量化图表的结构特征。然后我们设计了一个代表模型和图形之间关系的元图,并在元图上开发一个图表元化的元精子学习器,用来估计每个模型与不同图表的相关性。通过广泛的实验,我们展示了使用AutoGGML的当前快速测试方法,同时在快速的图表中持续地选择一种新的方法。