利用变形测试对堆装的生成反对向网络进行强力评价 (Robustness Evaluation of Stacked Generative Adversarial Networks using Metamorphic Testing)

Synthesising photo-realistic images from natural language is one of the challenging problems in computer vision. Over the past decade, a number of approaches have been proposed, of which the improved Stacked Generative Adversarial Network (StackGAN-v2) has proven capable of generating high resolution images that reflect the details specified in the input text descriptions. In this paper, we aim to assess the robustness and fault-tolerance capability of the StackGAN-v2 model by introducing variations in the training data. However, due to the working principle of Generative Adversarial Network (GAN), it is difficult to predict the output of the model when the training data are modified. Hence, in this work, we adopt Metamorphic Testing technique to evaluate the robustness of the model with a variety of unexpected training dataset. As such, we first implement StackGAN-v2 algorithm and test the pre-trained model provided by the original authors to establish a ground truth for our experiments. We then identify a metamorphic relation, from which test cases are generated. Further, metamorphic relations were derived successively based on the observations of prior test results. Finally, we synthesise the results from our experiment of all the metamorphic relations and found that StackGAN-v2 algorithm is susceptible to input images with obtrusive objects, even if it overlaps with the main object minimally, which was not reported by the authors and users of StackGAN-v2 model. The proposed metamorphic relations can be applied to other text-to-image synthesis models to not only verify the robustness but also to help researchers understand and interpret the results made by the machine learning models.

翻译：以自然语言制作的摄影现实合成图像是计算机视觉中具有挑战性的问题之一。在过去十年中,提出了若干方法,其中经过改进的Staacked General Aversarial网络(StackGAN-v2)证明能够产生高分辨率图像,反映输入文本描述中具体规定的细节。在本文件中,我们的目标是通过引入培训数据的变化来评估StackGAN-v2模型的坚固性和错错觉容忍能力。然而,由于General Aversarial 网络(GAN)的工作原则,在培训数据修改时很难预测模型的产出。因此,在这项工作中,我们采用了变形测试技术,能够用各种出乎意料的培训数据集描述来评价模型的稳健健健。因此,我们首先采用StaackGAN-V2算法,测试原始作者提供的预培训模型,以便为我们的实验建立地面真相模型。我们随后确定了一种变形关系,从中产生了测试案例。此外,在对模型进行变形关系中,甚至根据对模型的变形关系,根据对正态2的模型的观察结果,我们使用的是Statraphical-dealal exalationalationalation realation relation relational