The exponential growth of scientific literature makes secondary literature abridgements increasingly demanding. We introduce a new open-source framework for systematic reviews that significantly reduces time and human resource allocation for collecting and screening scientific literature. The framework provides three main tools: 1) an automatic citation search engine and manager that collects records from multiple online sources with a unified query syntax, 2) a Bayesian, active machine learning, citation screening tool based on iterative human-machine interaction to increase predictive accuracy and, 3) a semi-automatic, data-driven query generator to create new search queries from existing citation data sets. To evaluate the automatic screener's performance, we estimated the median posterior sensitivity and efficiency [90% Credible Intervals] using Bayesian simulation to predict the distribution of undetected potentially relevant records. Tested on an example topic, the framework collected 17,755 unique records through the citation manager; 766 records required human evaluation while the rest were excluded by the automatic classifier; the theoretical efficiency was 95.6% [95.3%, 95.7%] with a sensitivity of 100% [93.5%, 100%]. A new search query was generated from the labelled dataset, and 82,579 additional records were collected; only 567 records required human review after automatic screening, and six additional positive matches were found. The overall expected sensitivity decreased to 97.3% [73.8%, 100%] while the efficiency increased to 98.6% [98.2%, 98.7%]. For large studies, the framework can significantly reduce the human resources required to conduct literature reviews by simplifying citation collection and screening while demonstrating exceptional sensitivity. Such a tool can improve the standardization and repeatability of systematic reviews.
翻译:科学文献的指数式增长使得二次文献的缩略要求越来越高。 我们为系统审查引入了新的开放源码框架, 大幅缩短了收集和筛选科学文献的时间和人力资源分配。 框架提供了三大工具:(1) 自动引用搜索引擎和管理员, 以统一的查询语法从多个在线来源收集记录, 2) 巴伊西亚语、 积极的机器学习、 引用筛选工具, 以迭接的人体机器互动为基础, 提高预测准确性; 3) 半自动数据驱动的查询生成器, 以从现有的引用数据集中创建新的搜索查询查询。 为了评估自动筛选器的性能, 我们估算了中位的海报敏感度和效率[90% 。 [90%的可识别性 。 使用巴伊西亚语模拟来预测未察觉的潜在相关记录的分布。 测试了一个实例, 框架收集了17 755个独特的记录, 需要766个记录, 而其他记录则被自动分类排除; 理论效率为95. 2% (9.3%, 95.7 % 和95., 的理论效率, 的敏感性为[93.5 %, 100 % 。