We study the problem of learning to reason in large scale knowledge graphs (KGs). More specifically, we describe a novel reinforcement learning framework for learning multi-hop relational paths: we use a policy-based agent with continuous states based on knowledge graph embeddings, which reasons in a KG vector space by sampling the most promising relation to extend its path. In contrast to prior work, our approach includes a reward function that takes the accuracy, diversity, and efficiency into consideration. Experimentally, we show that our proposed method outperforms a path-ranking based algorithm and knowledge graph embedding methods on Freebase and Never-Ending Language Learning datasets.
翻译:我们研究大规模知识图表(KGs)中学习理性的问题。更具体地说,我们描述一个用于学习多希望关系路径的新强化学习框架:我们使用基于政策的工具,根据知识图表嵌入的连续状态,这是在KG矢量空间中通过取样最有希望的关系扩展路径的原因。与以往的工作不同,我们的方法包括一种奖励功能,该功能考虑到准确性、多样性和效率。实验性地说,我们提出的方法比基于路径的算法和知识图表嵌入自由基础和永不停止语言学习数据集的方法要好。