High-quality medical systematic reviews require comprehensive literature searches to ensure the recommendations and outcomes are sufficiently reliable. Indeed, searching for relevant medical literature is a key phase in constructing systematic reviews and often involves domain (medical researchers) and search (information specialists) experts in developing the search queries. Queries in this context are highly complex, based on Boolean logic, include free-text terms and index terms from standardised terminologies (e.g., the Medical Subject Headings (MeSH) thesaurus), and are difficult and time-consuming to build. The use of MeSH terms, in particular, has been shown to improve the quality of the search results. However, identifying the correct MeSH terms to include in a query is difficult: information experts are often unfamiliar with the MeSH database and unsure about the appropriateness of MeSH terms for a query. Naturally, the full value of the MeSH terminology is often not fully exploited. This article investigates methods to suggest MeSH terms based on an initial Boolean query that includes only free-text terms. In this context, we devise lexical and pre-trained language models based methods. These methods promise to automatically identify highly effective MeSH terms for inclusion in a systematic review query. Our study contributes an empirical evaluation of several MeSH term suggestion methods. We further contribute an extensive analysis of MeSH term suggestions for each method and how these suggestions impact the effectiveness of Boolean queries.
翻译:高质量的医疗系统审查需要全面的文献搜索,以确保建议和结果足够可靠。事实上,寻找相关的医疗文献是建立系统审查的关键阶段,往往涉及领域(医学研究人员)和搜索(信息专家)专家来进行搜索查询。在这方面,根据布莱安逻辑,查询非常复杂,包括来自标准化术语(如医学主题标题术语)的免费文本术语和索引术语(如医学主题术语词库)的免费文本术语和索引术语,而且开发起来既困难又费时。特别是,使用MesH术语已经表明可以提高搜索结果的质量。然而,在查询中找到正确的MesH术语是困难的:信息专家往往不熟悉MesH数据库,不能确定MesH术语在查询中的适当性。自然,MesH术语的全部价值往往没有得到充分利用。本文章调查了根据初步布尔兰查询(仅包括自由文字术语)提出MeS术语建议的方法。在这方面,我们设计了词汇和预培训前语言术语的正确性术语,很难在查询中找到一种以系统化语言为基础的方法。我们用什么方法来进一步评估方法,我们用一种以系统化语言分析方法来评估。这些术语的术语的术语有助于我们以自动地评估。