Generative Adversarial Networks (GANs) have become increasingly powerful, generating mind-blowing photorealistic images that mimic the content of datasets they were trained to replicate. One recurrent theme in medical imaging is whether GANs can also be effective at generating workable medical data as they are for generating realistic RGB images. In this paper, we perform a multi-GAN and multi-application study to gauge the benefits of GANs in medical imaging. We tested various GAN architectures from basic DCGAN to more sophisticated style-based GANs on three medical imaging modalities and organs namely : cardiac cine-MRI, liver CT and RGB retina images. GANs were trained on well-known and widely utilized datasets from which their FID score were computed to measure the visual acuity of their generated images. We further tested their usefulness by measuring the segmentation accuracy of a U-Net trained on these generated images. Results reveal that GANs are far from being equal as some are ill-suited for medical imaging applications while others are much better off. The top-performing GANs are capable of generating realistic-looking medical images by FID standards that can fool trained experts in a visual Turing test and comply to some metrics. However, segmentation results suggests that no GAN is capable of reproducing the full richness of a medical datasets.
翻译:医学成像中经常出现的一个主题是,GANs是否也能有效地生成可行的医疗数据,因为它们是产生现实的 RGB 图像。在本文中,我们进行了一项多GAN和多应用研究,以衡量GANs在医学成像中的益处。我们测试了从基础DCGAN到更精密的基于风格的GAN的各种GAN结构,在三种医学成像模式和器官上,即:心脏病-MRI、肝脏CT和RGB retina图像上,模仿了他们所训练的图像内容。在医学成像模型和器官上,GANs的高级和广泛利用数据集被训练为众所周知的数据集,从中计算出他们的FID分数以测量其生成图像的视觉精度。我们进一步测试了这些数据集的分解准确性。结果显示,GANs远不能等同一些不适合医学成像应用,而另一些则远不如其他。GANs最能表现的模型测试了GAN的精确度标准,因此,GANs 能够通过直观的模型再测试,从而显示GAN的精确的模型测试结果。