Deep neural networks have emerged as very successful tools for image restoration and reconstruction tasks. These networks are often trained end-to-end to directly reconstruct an image from a noisy or corrupted measurement of that image. To achieve state-of-the-art performance, training on large and diverse sets of images is considered critical. However, it is often difficult and/or expensive to collect large amounts of training images. Inspired by the success of Data Augmentation (DA) for classification problems, in this paper, we propose a pipeline for data augmentation for accelerated MRI reconstruction and study its effectiveness at reducing the required training data in a variety of settings. Our DA pipeline, MRAugment, is specifically designed to utilize the invariances present in medical imaging measurements as naive DA strategies that neglect the physics of the problem fail. Through extensive studies on multiple datasets we demonstrate that in the low-data regime DA prevents overfitting and can match or even surpass the state of the art while using significantly fewer training data, whereas in the high-data regime it has diminishing returns. Furthermore, our findings show that DA can improve the robustness of the model against various shifts in the test distribution.
翻译:深神经网络已成为图像恢复和重建任务非常成功的工具,这些网络往往经过培训,从噪音或腐败的测量中直接重建图像。为了实现最先进的性能,对大型和多种图像进行培训被认为是至关重要的。然而,收集大量培训图像往往很困难和(或)昂贵。由于数据增强(DA)成功地解决分类问题,我们在本文件中提议为加速MRI重建提供数据增强管道,并研究其在减少各种环境所需培训数据方面的效力。我们的DA管道MRAugment专门设计了将医学成像测量中存在的差异用作忽视问题物理的天真的DA战略。通过对多个数据集的广泛研究,我们证明DA在低数据系统中防止过度装配,并且能够匹配甚至超过艺术状态,同时使用的培训数据要少得多,而在高数据系统中,它减少了回报。此外,我们的研究结果表明,DA可以改进模型的稳健性,防止测试分布的各种变化。