Graph neural networks, as powerful deep learning tools to model graph-structured data, have demonstrated remarkable performance on numerous graph learning tasks. To counter the data noise and data scarcity issues in deep graph learning (DGL), increasing graph data augmentation research has been conducted lately. However, conventional data augmentation methods can hardly handle graph-structured data which is defined on non-Euclidean space with multi-modality. In this survey, we formally formulate the problem of graph data augmentation and further review the representative techniques in this field. Specifically, we first propose a taxonomy for graph data augmentation and then provide a structured review by categorizing the related work based on the augmented information modalities. Focusing on the two challenging problems in DGL (i.e., optimal graph learning and low-resource graph learning), we also discuss and review the existing learning paradigms which are based on graph data augmentation. Finally, we point out a few directions and challenges on promising future works.
翻译:作为模拟图形结构数据的强大深层学习工具,图表神经网络在众多图表学习任务方面表现出了显著的成绩。为了应对深图学习(DGL)中的数据噪音和数据稀缺问题,最近开展了越来越多的图形数据增强研究。然而,常规数据增强方法很难处理非欧裔多模式空间界定的图形结构数据。在这次调查中,我们正式提出了图形数据增强问题,并进一步审查了该领域的代表性技术。具体地说,我们首先提出了图表数据增强的分类法,然后根据扩大的信息模式对相关工作进行分类,从而提供了结构化审查。我们侧重于DGL的两个挑战性问题(即最佳图形学习和低资源图学习),我们还讨论和审查了基于图形数据增强的现有学习模式。最后,我们指出了关于有前途的未来工作的几个方向和挑战。