Author name ambiguity remains a critical open problem in digital libraries due to synonymy and homonymy of names. In this paper, we propose a novel approach to link author names to their real-world entities by relying on their co-authorship pattern and area of research. Our supervised deep learning model identifies an author by capturing his/her relationship with his/her co-authors and area of research, which is represented by the titles and sources of the target author's publications. These attributes are encoded by their semantic and symbolic representations. To this end, Bib2Auth uses ~ 22K bibliographic records from the DBLP repository and is trained with each pair of co-authors. The extensive experiments have proved the capability of the approach to distinguish between authors sharing the same name and recognize authors with different name variations. Bib2Auth has shown good performance on a relatively large dataset, which qualifies it to be directly integrated into bibliographic indices.
翻译:作者姓名的模糊性仍然是数字图书馆中一个至关重要的未决问题,原因是名称的同义性和同义性。 在本文中,我们提出一种新的方法,将作者姓名与其真实世界实体联系起来,依靠其共同作者的模式和研究领域。我们监督的深层次学习模式通过捕捉作者与其共同作者的关系和研究领域(由目标作者出版物的名称和来源所代表)来识别作者。这些属性由它们的语义和象征性的表达形式来编码。为此,Bib2Auth使用了来自DBLP存储处的22K文献记录,并经过每对共同作者的培训。广泛的实验证明,能够区分拥有相同名称的作者,并识别不同名称的作者。Bib2Auth在相对庞大的数据集上表现良好,因此有资格将其直接纳入文献索引。