In 2020 the Australia New Zealand Standard Research Classification Fields of Research Codes (ANZSRC FoR codes) were updated by their owners. This has led the sector to need to update their systems of reference and has caused suppliers working in the research information sphere to need to update both systems and data. This paper describes the approach developed by Digital Science's Dimensions team to the creation of an improved machine learning training set, and the mapping of that set from FoR 2008 codes to FoR 2020 codes so that Dimensions classification approach for the ANZSRC codes could be improved and updated.
翻译:2020年,澳大利亚-新西兰标准研究分类领域研究守则(ANZSRC FOR代码)由所有者更新,导致该部门需要更新其参考系统,并使研究信息领域的供应商需要更新系统和数据,本文介绍了数字科学层面小组为创建经改进的机器学习培训成套方法,并将这套方法从2008年的FOR代码绘制为2020年的FOR代码,以便改进和更新ANZSRC代码的尺寸分类方法。