Current citation practices observed in articles are very noisy, confusing, and not standardised, making identifying the cited works problematic for hu-mans and any reference extraction software. In this work, we want to investigate such citation practices for referencing different types of entities and, in particular, to understand the most used metadata in bibliographic refer-ences. We identified 36 types of cited entities (the most cited ones were articles, books, and proceeding papers) within the 34,140 bibliographic references extracted from a vast set of journal articles on 27 different subject ar-eas. The analysis of such bibliographic references, grouped by the particular type of cited entities, enabled us to highlight the most used metadata for de-fining bibliographic references across the subject areas. However, we also noticed that, in some cases, bibliographic references did not provide the essential elements to identify the work they refer to easily.
翻译:文章中观察到的当前引用做法非常吵闹、令人困惑,而且没有标准化,因此查明所引用的作品对Human和任何参考提取软件都很成问题。在这项工作中,我们希望调查这些引用做法,以查找不同类型的实体,特别是了解文献参考文献中最常用的元数据。我们在34 140个参考文献中确定了36类引用实体(最引证的是文章、书籍和诉讼文件),这些参考文献摘自关于27个不同主题的众多期刊文章。对此类文献参考文献的分析,按所引用实体的具体类型分类,使我们能够突出用于在主题领域进行脱钩参考文献的最常用的元数据。然而,我们还注意到,在有些情况下,参考文献并未提供基本要素,用以确定它们容易提及的作品。