Generative adversarial networks are a kind of deep generative model with the potential to revolutionize biomedical imaging. This is because GANs have a learned capacity to draw whole-image variates from a lower-dimensional representation of an unknown, high-dimensional distribution that fully describes the input training images. The overarching problem with GANs in clinical applications is that there is not adequate or automatic means of assessing the diagnostic quality of images generated by GANs. In this work, we demonstrate several tests of the statistical accuracy of images output by two popular GAN architectures. We designed several stochastic object models (SOMs) of distinct features that can be recovered after generation by a trained GAN. Several of these features are high-order, algorithmic pixel-arrangement rules which are not readily expressed in covariance matrices. We designed and validated statistical classifiers to detect the known arrangement rules. We then tested the rates at which the different GANs correctly reproduced the rules under a variety of training scenarios and degrees of feature-class similarity. We found that ensembles of generated images can appear accurate visually, and correspond to low Frechet Inception Distance scores (FID), while not exhibiting the known spatial arrangements. Furthermore, GANs trained on a spectrum of distinct spatial orders did not respect the given prevalence of those orders in the training data. The main conclusion is that while low-order ensemble statistics are largely correct, there are numerous quantifiable errors per image that plausibly can affect subsequent use of the GAN-generated images.
翻译:产生对抗性网络是一种深层次的基因模型,有可能使生物医学成像革命化。这是因为GANs具有从一个不为人知的、高度分布式的低维中绘制全象变异体的学习能力,它充分描述了投入培训图像。临床应用中GANs的首要问题是,没有足够或自动的手段来评估GANs产生的图像的诊断质量。在这项工作中,我们展示了两个广受欢迎的GAN结构的图像输出的统计精确度的若干测试。我们设计了几个不同特性的随机对象模型(SOMS),这些模型在经过培训的GAN生成后可以从一个低维度分布式的图像中回收。这些特征中有些是高端的、算法像素定式规则,在变异矩阵中不易表达出来。我们设计并验证了统计分类方法,以探测GANs产生的图像的诊断质量。我们测试了不同GANs在各种培训情景下正确复制规则的速度和特征类相似程度。我们发现,生成的图像组合可以看得准确,基本上反映低度的GANsel-Aximles massimal的图像,而后来的精确度则没有进行精确测测测测测测。