项目名称: 中英文论文中的中国作者姓名消歧研究
项目编号: No.71473236
项目类型: 面上项目
立项/批准年度: 2015
项目学科: 管理科学
项目作者: 袁军鹏
作者单位: 中国科学技术信息研究所
项目金额: 59万元
中文摘要: 作者姓名消歧是科技评价、科学计量学、数字图书馆、信息检索等领域当前急需但是尚未解决的基本问题之一。越来越多的中国学者同时发表中、英文论文,但是中国作者的英文姓名音译、简写后重名现象更加严重,中国作者中、英姓名消歧就更加复杂和困难。本项目提出寻找拥有共同中文姓名或共同英文姓名的中国作者所发表论文的真正作者智能算法。该算法主要包括中、英文论文中基于唯一性特征的作者姓名消歧算法,改进的合著网络和作者领域演化的姓名消歧算法等。在进行英文姓名消歧时,集成利用中文论文信息,减少英文同名数据集的规模,提高姓名消歧效率。这些问题大多数是对此领域的新探索,对于发展和完善作者姓名消歧的理论与方法有较大的意义。该问题的解决可以把基于科学计量学的评价和文献检索推进到微观的个人层面,可以为学科发展、科研评价、产出分析、机构测度、人才评价、成果管理、信息搜索等提供更准确的数据支撑,具有广泛的应用背景和发展前景。
中文关键词: 姓名消歧;合著网络;引文分析;科学计量学
英文摘要: For any work of literature, a fundamental issue is to identify the individual(s) who wrote it, and conversely, to identify all of the works that belong to a given individual. Attribution would seem to be a simple process and yet it represents a major, unsolved problem for information science. It is more difficult to identify the Chinese author's English name. This project focuses on Chinese author name disambiguation who wrote Chinese and English Papers. We analysis papers and authorship characteristics, combined with the nature of the characteristics of the existing algorithms, design of machine learning algorithms. Specific analysis include: based on the unique characteristics of author name disambiguation, propose the evolution of the field and co-author network of author name disambiguation, integrating existing Chinese information to assist the English of author name disambiguation, especially identify the different data sets but the author have the same name in English, to reduce the scale of same name in English. The project is the research focus of the field of information science, bibliometrics, web search, natural language processing and information extraction in recent years. Solution of the problem to the literature data retrieval and evaluation based on bibliometrics advance to the micro-individual level, can provide data to support the personnel evaluation, preventing the phenomenon of academic false and academic fraud, has a wide application background and development prospects.
英文关键词: Name Disambiguation;Co-author Networks;Citation Analysis;Scientometrics