Gradient-based attention modeling has been used widely as a way to visualize and understand convolutional neural networks. However, exploiting these visual explanations during the training of generative adversarial networks (GANs) is an unexplored area in computer vision research. Indeed, we argue that this kind of information can be used to influence GANs training in a positive way. For this reason, in this paper, it is shown how gradient based attentions can be used as knowledge to be conveyed in a teacher-student paradigm for multi-domain image-to-image translation tasks in order to improve the results of the student architecture. Further, it is demonstrated how "pseudo"-attentions can also be employed during training when teacher and student networks are trained on different domains which share some similarities. The approach is validated on multi-domain facial attributes transfer and human expression synthesis showing both qualitative and quantitative results.
翻译:以渐进为基础的关注模型被广泛用作视觉化和理解进化神经网络的一种方法,然而,在培训基因对抗网络时利用这些直观解释是计算机视觉研究中尚未探索的领域。事实上,我们争辩说,这种信息可以正面地影响GAN培训。为此,本文件显示,基于梯度的关注可以如何用作在教师-学生多多领域图像到图像翻译任务模式中传递的知识,以便改善学生结构的结果。此外,在师生网络接受不同领域培训时,如何在培训过程中使用“假想”的意向也证明了这一点,因为师生网络在不同的领域接受培训,这些领域具有一些相似之处。这种方法在多领域面部面貌属性传输和人文表达合成中得到了验证,显示了质和量两方面的结果。