Identifying the research topics that best describe the scope of a scientific publication is a crucial task for editors, in particular because the quality of these annotations determine how effectively users are able to discover the right content in online libraries. For this reason, Springer Nature, the world's largest academic book publisher, has traditionally entrusted this task to their most expert editors. These editors manually analyse all new books, possibly including hundreds of chapters, and produce a list of the most relevant topics. Hence, this process has traditionally been very expensive, time-consuming, and confined to a few senior editors. For these reasons, back in 2016 we developed Smart Topic Miner (STM), an ontology-driven application that assists the Springer Nature editorial team in annotating the volumes of all books covering conference proceedings in Computer Science. Since then STM has been regularly used by editors in Germany, China, Brazil, India, and Japan, for a total of about 800 volumes per year. Over the past three years the initial prototype has iteratively evolved in response to feedback from the users and evolving requirements. In this paper we present the most recent version of the tool and describe the evolution of the system over the years, the key lessons learnt, and the impact on the Springer Nature workflow. In particular, our solution has drastically reduced the time needed to annotate proceedings and significantly improved their discoverability, resulting in 9.3 million additional downloads. We also present a user study involving 9 editors, which yielded excellent results in term of usability, and report an evaluation of the new topic classifier used by STM, which outperforms previous versions in recall and F-measure.
翻译:确定最能描述科学出版物范围的研究专题是编辑的一项关键任务,特别是因为这些说明的质量决定了用户如何有效地发现网上图书馆中正确内容。 为此,世界上最大的学术书籍出版商Springer Nature(Springer Nature)传统上将这项任务委托给他们最专业的编辑。这些编辑手动分析所有新书,可能包括数百个章节,并编制一份最相关的专题清单。因此,这一过程传统上非常昂贵,费时,仅限于少数资深编辑。为此原因,我们早在2016年就开发了Smart TopyMiner(STM),这是一个由本科学驱动的内科应用程序,协助Springer National编辑团队批注计算机科学会议议事录的所有书籍。自此以后,德国、中国、巴西、印度和日本的编辑们每年定期使用STM,共约800卷。过去三年,最初的原型在回应用户反馈和不断演变的要求时,我们介绍了工具的最新版本,并描述了系统的最新版本的演变情况,描述了系统在几年里诺自然编辑组编辑团队在说明所有书程中所有书籍的书本的书本。