Concerns about the reproducibility of deep learning research are more prominent than ever, with no clear solution in sight. The relevance of machine learning research can only be improved if we also employ empirical rigor that incorporates reproducibility guidelines, especially so in the medical imaging field. The Medical Imaging with Deep Learning (MIDL) conference has made advancements in this direction by advocating open access, and recently also recommending authors to make their code public - both aspects being adopted by the majority of the conference submissions. This helps the reproducibility of the methods, however, there is currently little or no support for further evaluation of these supplementary material, making them vulnerable to poor quality, which affects the impact of the entire submission. We have evaluated all accepted full paper submissions to MIDL between 2018 and 2022 using established, but slightly adjusted guidelines on reproducibility and the quality of the public repositories. The evaluations show that publishing repositories and using public datasets are becoming more popular, which helps traceability, but the quality of the repositories has not improved over the years, leaving room for improvement in every aspect of designing repositories. Merely 22% of all submissions contain a repository that were deemed repeatable using our evaluations. From the commonly encountered issues during the evaluations, we propose a set of guidelines for machine learning-related research for medical imaging applications, adjusted specifically for future submissions to MIDL.
翻译:深层学习医学成像(MIDL)会议倡导开放访问,最近还建议作者公开其守则,大多数会议文件都采纳了这两个方面,有助于这些方法的推广,但是,目前很少或根本没有支持进一步评价这些补充材料,使其质量低劣,影响到整个划界案的影响,因此,机器学习研究的相关性才能得到改善。我们评估了2018年至2022年期间所有被接受的向MIDL提交的完整文件,采用了既定但略有调整的关于可复制性和公共储存库质量的准则。评价显示,出版库和使用公共数据集越来越受欢迎,有助于追踪,但储存库的质量多年来没有改善,在设计储存库的每个方面都有改进的余地。从所有提交文件的22%到今后提出的与MIDL有关的共同研究都含有一个经过调整的档案库,供我们今后使用的与ML有关的研究之用。