Peer review is the primary gatekeeper of scientific merit and quality, yet it is prone to bias and suffers from low efficiency. This demands cross-disciplinary scrutiny of the processes that underlie peer reviewing; however, quantitative research is limited by the data availability, as most of the peer reviewing data across research disciplines is never made public. Existing data collection efforts focus on few scientific domains and do not address a range of ethical, license- and confidentiality-related issues associated with peer reviewing data, preventing wide-scale research and application development. While recent methods for peer review analysis and processing show promise, a solid data foundation for computational research in peer review is still missing. To address this, we present an in-depth discussion of peer reviewing data, outline the ethical and legal desiderata for peer reviewing data collection, and propose the first continuous, donation-based data collection workflow that meets these requirements. We report on the ongoing implementation of this workflow at the ACL Rolling Review and deliver the first insights obtained with the newly collected data.
翻译:同行审查是科学优缺点和质量的主要守门人,但它容易产生偏向,效率低。这要求对同行审议所依据的进程进行跨学科审查;然而,定量研究受到数据可得性的限制,因为各研究学科同行审议的数据大多从未公布。现有的数据收集工作侧重于少数科学领域,没有处理与同行审议数据有关的一系列伦理、许可证和保密问题,防止了广泛的研究和应用开发。虽然最近的同行审议分析和处理方法显示前景光明,但同行审议中的计算研究仍然缺乏坚实的数据基础。为了解决这一问题,我们深入讨论同行审议数据,概述同行审议数据收集的道德和法律侧面,并提出符合这些要求的第一个持续、基于捐赠的数据收集工作流程。我们向ACL滚动审查报告这一工作流程的持续执行情况,并提供从新收集的数据中获得的初步见解。