The COVID-19 pandemic has driven ever-greater demand for tools which enable efficient exploration of biomedical literature. Although semi-structured information resulting from concept recognition and detection of the defining elements of clinical trials (e.g. PICO criteria) has been commonly used to support literature search, the contributions of this abstraction remain poorly understood, especially in relation to text-based retrieval. In this study, we compare the results retrieved by a standard search engine with those filtered using clinically-relevant concepts and their relations. With analysis based on the annotations from the TREC-COVID shared task, we obtain quantitative as well as qualitative insights into characteristics of relational and concept-based literature exploration. Most importantly, we find that the relational concept selection filters the original retrieved collection in a way that decreases the proportion of unjudged documents and increases the precision, which means that the user is likely to be exposed to a larger number of relevant documents.
翻译:COVID-19大流行促使对有助于有效探索生物医学文献的工具的需求日益增加,尽管由于概念承认和检测临床试验的界定要素(如石化组织标准)而形成的半结构化信息被普遍用于支持文献搜索,但这种抽象性的贡献仍然不甚为人理解,特别是在基于文本的检索方面。在本研究中,我们比较标准搜索引擎检索的结果与使用临床相关概念及其关系过滤的结果。根据TREC-COVID共同任务的说明进行的分析,我们对关系和基于概念的文献探索的特点进行了定量和定性的深入了解。最重要的是,我们发现,关系概念选择过滤了原始检索的收集,从而降低了未判断文件的比例,提高了精确度,这意味着用户可能接触到更多相关文件。