Graph neural networks (GNNs) have been utilized for various natural language processing (NLP) tasks lately. The ability to encode corpus-wide features in graph representation made GNN models popular in various tasks such as document classification. One major shortcoming of such models is that they mainly work on homogeneous graphs, while representing text datasets as graphs requires several node types which leads to a heterogeneous schema. In this paper, we propose a transductive hybrid approach composed of an unsupervised node representation learning model followed by a node classification/edge prediction model. The proposed model is capable of processing heterogeneous graphs to produce unified node embeddings which are then utilized for node classification or link prediction as the downstream task. The proposed model is developed to classify stock market technical analysis reports, which to our knowledge is the first work in this domain. Experiments, which are carried away using a constructed dataset, demonstrate the ability of the model in embedding extraction and the downstream tasks.
翻译:最近,在各种自然语言处理(NLP)任务中使用了图形神经网络(GNNs) 。在图形显示中编码全系统功能的能力使得GNN模型在文件分类等各种任务中很受欢迎。这些模型的一个主要缺点是,这些模型主要是在同质图形上工作,而作为图形代表文本数据集需要几种节点类型,从而导致一个多式的图象。在本文件中,我们建议采用一种转式混合方法,由未经监督的节点代表学习模型组成,然后是节点分类/前置预测模型。拟议的模型能够处理多式图形,以产生统一的节点嵌入,然后作为下游任务用于节点分类或链接预测。拟议的模型是用来对股票市场技术分析报告进行分类的,而我们所了解的是该领域的首项工作。实验正在使用一个已建数据集进行,展示模型嵌入提取和下游任务的能力。