Computational reproducibility is a growing problem that has been extensively studied among computational researchers and within the signal processing and machine learning research community. However, with the changing landscape of signal processing and machine learning research come new obstacles and unseen challenges in creating reproducible experiments. Due to these new challenges most computational experiments have become difficult, if not impossible, to be reproduced by an independent researcher. In 2016 a survey conducted by the journal Nature found that 50% of researchers were unable to reproduce their own experiments. While the issue of computational reproducibility has been discussed in the literature and specifically within the signal processing community, it is still unclear to most researchers what are the best practices to ensure reproducibility without impinging on their primary responsibility of conducting research. We feel that although researchers understand the importance of making experiments reproducible, the lack of a clear set of standards and tools makes it difficult to incorporate good reproducibility practices in most labs. It is in this regard that we aim to present signal processing researchers with a set of practical tools and strategies that can help mitigate many of the obstacles to producing reproducible computational experiments.
翻译:计算再生是计算研究人员和信号处理和机器学习研究界广泛研究的一个日益严重的问题,但随着信号处理和机器学习研究不断变化的格局,在创造可复制的实验方面出现了新的障碍和不可见的挑战。由于这些新的挑战,大多数计算实验都变得难以(甚至不可能)由独立研究人员复制。2016年,《自然》杂志进行的一项调查发现,50%的研究人员无法复制自己的实验。虽然计算再生的问题已在文献中讨论过,特别是在信号处理界中讨论过,但大多数研究人员仍然不清楚哪些最佳做法可以确保再生,同时又不辜负其开展研究的主要责任。我们感到,尽管研究人员理解使实验可复制的重要性,但缺乏一套明确的标准和工具使得难以将良好的再生实践纳入大多数实验室。在这方面,我们的目标是向信号处理研究人员提供一套实用的工具和战略,帮助减少产生可复制的计算实验的许多障碍。