We present a forecasting analysis on the growth of scientific literature related to COVID-19 expected for 2021. Considering the paramount scientific and financial efforts made by the research community to find solutions to end the COVID-19 pandemic, an unprecedented volume of scientific outputs is being produced. This questions the capacity of scientists, politicians and citizens to maintain infrastructure, digest content and take scientifically informed decisions. A crucial aspect is to make predictions to prepare for such a large corpus of scientific literature. Here we base our predictions on the ARIMA model and use two different data sources: the Dimensions and World Health Organization COVID-19 databases. These two sources have the particularity of including in the metadata information the date in which papers were indexed. We present global predictions, plus predictions in three specific settings: type of access (Open Access), NLM source (PubMed and PMC), and domain-specific repository (SSRN and MedRxiv). We conclude by discussing our findings.
翻译:考虑到研究界为寻找结束COVID-19大流行的解决方案作出了重大的科学和财政努力,目前正在产生前所未有的科学产出,这就对科学家、政治家和公民维持基础设施、消化内容和作出科学知情决定的能力提出了疑问,一个关键方面是作出预测,以便为如此大量的科学文献作准备。我们在这里以ARIMA模型为基础作出预测,并使用两个不同的数据来源:尺寸和世界卫生组织COVID-19数据库。这两个来源在元数据信息中特别包括文件编制索引的日期。我们提出全球预测,加上在三种特定情况下的预测:准入类型(开放存取)、NLM来源(普梅德和PMC)和具体领域储存(SSRN和MedRxiv)。我们最后通过讨论我们的结论。