Predicting the impact of publications in science and technology has become an important research area, which is useful in various real world scenarios such as technology investment, research direction selection, and technology policymaking. Citation trajectory prediction is one of the most popular tasks in this area. Existing approaches mainly rely on mining temporal and graph data from academic articles. Some recent methods are capable of handling cold-start prediction by aggregating metadata features of new publications. However, the implicit factors causing citations and the richer information from handling temporal and attribute features still need to be explored. In this paper, we propose CTPIR, a new citation trajectory prediction framework that is able to represent the influence (the momentum of citation) of either new or existing publications using the history information of all their attributes. Our framework is composed of three modules: difference-preserved graph embedding, fine-grained influence representation, and learning-based trajectory calculation. To test the effectiveness of our framework in more situations, we collect and construct a new temporal knowledge graph dataset from the real world, named AIPatent, which stems from global patents in the field of artificial intelligence. Experiments are conducted on both the APS academic dataset and our contributed AIPatent dataset. The results demonstrate the strengths of our approach in the citation trajectory prediction task.
翻译:预测科技出版物的影响已成为一个重要的研究领域,在技术投资、研究方向选择和技术决策等各种现实世界情景中都有用。引用轨迹预测是该领域最受欢迎的任务之一。现有方法主要依靠学术文章中的采矿时间和图表数据。一些最新方法能够通过汇总新出版物的元数据特征处理冷启动预测。然而,仍然需要探讨导致引证的隐含因素以及处理时间和属性特征特征的更丰富信息。在本文件中,我们提议CTPIR,这是一个新的引用轨迹预测框架,能够代表使用其所有属性的历史信息的新出版物或现有出版物的影响(引用的势头)。我们的框架由三个模块组成:有差异的图形嵌入、微微影响代表以及基于学习的轨迹计算。为了检验我们框架在更多情况下的有效性,我们从真实世界收集并建立一个新的时间知识图表数据集,名为AIPatent,该数据集来自人工智能学领域的全球专利。对APS学术预测方法进行实验,展示了我们轨迹预测的结果。