Generative Adversarial Networks (GAN) have many potential medical imaging applications, including data augmentation, domain adaptation, and model explanation. Due to the limited memory of Graphical Processing Units (GPUs), most current 3D GAN models are trained on low-resolution medical images, these models either cannot scale to high-resolution or are prone to patchy artifacts. In this work, we propose a novel end-to-end GAN architecture that can generate high-resolution 3D images. We achieve this goal by using different configurations between training and inference. During training, we adopt a hierarchical structure that simultaneously generates a low-resolution version of the image and a randomly selected sub-volume of the high-resolution image. The hierarchical design has two advantages: First, the memory demand for training on high-resolution images is amortized among sub-volumes. Furthermore, anchoring the high-resolution sub-volumes to a single low-resolution image ensures anatomical consistency between sub-volumes. During inference, our model can directly generate full high-resolution images. We also incorporate an encoder with a similar hierarchical structure into the model to extract features from the images. Experiments on 3D thorax CT and brain MRI demonstrate that our approach outperforms state of the art in image generation. We also demonstrate clinical applications of the proposed model in data augmentation and clinical-relevant feature extraction.
翻译:由于图形处理器(GPUs)的记忆有限,目前大多数的3D GAN模型都以低分辨率医学图像为培训对象,这些模型要么不能规模到高分辨率,要么容易形成偏差的文物。在这项工作中,我们建议建立一个新型端到端GAN结构,可以生成高分辨率的3D图像。我们通过在培训和推断之间使用不同的配置来实现这一目标。在培训过程中,我们采用一个等级结构,同时生成低分辨率图像版本和随机选择的高分辨率图像子卷。等级设计有两个优点:第一,高分辨率图像培训的记忆需求在子卷之间摊合。此外,将高分辨率子卷固定在单一低分辨率图像上,确保子卷之间的解析一致性。在测试过程中,我们的模型可以直接生成全高分辨率图像。我们还将一个具有类似级别结构的刻度模型和高分辨率图像子卷随机选编成。我们还将一个具有类似级别结构的高级分辨率结构的内嵌入模型,并在模型中演示模型中的模型。