This literature review identifies indicators that associate with higher impact or higher quality research from article text (e.g., titles, abstracts, lengths, cited references and readability) or metadata (e.g., the number of authors, international or domestic collaborations, journal impact factors and authors' h-index). This includes studies that used machine learning techniques to predict citation counts or quality scores for journal articles or conference papers. The literature review also includes evidence about the strength of association between bibliometric indicators and quality score rankings from previous UK Research Assessment Exercises (RAEs) and REFs in different subjects and years and similar evidence from other countries (e.g., Australia and Italy). In support of this, the document also surveys studies that used public datasets of citations, social media indictors or open review texts (e.g., Dimensions, OpenCitations, Altmetric.com and Publons) to help predict the scholarly impact of articles. The results of this part of the literature review were used to inform the experiments using machine learning to predict REF journal article quality scores, as reported in the AI experiments report for this project. The literature review also covers technology to automate editorial processes, to provide quality control for papers and reviewers' suggestions, to match reviewers with articles, and to automatically categorise journal articles into fields. Bias and transparency in technology assisted assessment are also discussed.
翻译:文献审查还查明了与文章文本(如标题、摘要、长度、引用的引用和可读性)或元数据(如作者人数、国际或国内协作、期刊影响因素和作者h-指数)或元数据(如作者人数、国际或国内协作、期刊影响因素和作者h-指数)的更高影响或高质量研究相关联的指标,其中包括利用机器学习技术预测期刊文章或会议文件的引用计数或质量分数(如尺寸、公开标签、Altectric.com和Publons)来帮助预测文章的学术影响的证据。文献审查的这一部分成果被用来指导利用机器学习来预测REF期刊质量分数的实验,如在内部审计员、社会媒体指标或公开审查文本(如尺寸、公开标签、Altimas.com和Publblons)使用公共数据集(例如澳大利亚和意大利)的研究,此外,文件还利用机器学习来为项目质量审评员提供数据,并自动向数据审评员提供质量评估报告。