Denoising diffusion models have recently achieved state-of-the-art performance in many image-generation tasks. They do, however, require a large amount of computational resources. This limits their application to medical tasks, where we often deal with large 3D volumes, like high-resolution three-dimensional data. In this work, we present a number of different ways to reduce the resource consumption for 3D diffusion models and apply them to a dataset of 3D images. The main contribution of this paper is the memory-efficient patch-based diffusion model \textit{PatchDDM}, which can be applied to the total volume during inference while the training is performed only on patches. While the proposed diffusion model can be applied to any image generation tasks, we evaluate the method on the tumor segmentation task of the BraTS2020 dataset and demonstrate that we can generate meaningful three-dimensional segmentations.
翻译:最近,去噪扩散模型在许多图像生成任务中已经达到了最先进的表现。然而,它们需要大量的计算资源,因此限制了它们在医疗任务中的应用,而在医疗任务中我们经常处理大量的三维体积,如高分辨率三维数据。在这项工作中,我们提出了减少3D扩散模型资源消耗的多种方法,并将它们应用到一个包含3D图像的数据集上。本文主要贡献是内存有效的基于补丁的扩散模型PatchDDM,可以在推断期间对整个体积进行应用,而只在补丁上进行训练。虽然所提出的扩散模型可以应用于任何图像生成任务,但我们在BraTS2020数据集的肿瘤分割任务上评估了该方法,证明我们可以生成有意义的三维分割。