Multimodal machine learning has been widely studied for the development of general intelligence. Recently, the remarkable multimodal algorithms, the Perceiver and Perceiver IO, show competitive results for diverse dataset domains and tasks. However, recent works, Perceiver and Perceiver IO, have focused on heterogeneous modalities, including image, text, and speech, and there are few research works for graph structured datasets. A graph is one of the most generalized dataset structures, and we can represent the other dataset, including images, text, and speech, as graph structured data. A graph has an adjacency matrix different from other dataset domains such as text and image, and it is not trivial to handle the topological information, relational information, and canonical positional information. In this study, we provide a Graph Perceiver IO, the Perceiver IO for the graph structured dataset. We keep the main structure of the Graph Perceiver IO as the Perceiver IO because the Perceiver IO already handles the diverse dataset well, except for the graph structured dataset. The Graph Perceiver IO is a general method, and it can handle diverse datasets such as graph structured data as well as text and images. Comparing the graph neural networks, the Graph Perceiver IO requires a lower complexity, and it can incorporate the local and global information efficiently. We show that Graph Perceiver IO shows competitive results for diverse graph-related tasks, including node classification, graph classification, and link prediction.
翻译:为发展一般情报,对多式机器学习进行了广泛研究。 最近, 显著的多式联运算法、 Perceiver 和 Perceiver IO 显示不同数据集域和任务的竞争性结果。 然而, 最近的工作, Perceiver 和 Perceiver IO 侧重于多种模式, 包括图像、 文本和语言, 并且没有为图形结构数据集做多少研究。 图表是最普遍的数据集之一, 我们可以以图表结构化数据来代表其他数据集, 包括图像、 文本和语音。 图表有一个不同于文本和图像等其他数据集域的相近矩阵。 然而, 最近的工作, Perceiver 和 Perceiver IO 集中处理表层信息、 关联信息以及 罐体位置定位信息信息。 在本研究中, 我们为图表结构化数据集提供了一张Perceiver iO 的主结构化结构化数据结构化的图。 我们保留了图表的复杂结构结构, 如 Perceiver 分类, 因为 Perceiver IO 已经很好地处理过多样化的数据设置的数据设置,, 包括图表结构化的图表结构化的图表 和结构化的图表的图表, 格式化的图表显示, 。