Generative adversarial networks (GANs) have shown significant potential in modeling high dimensional distributions of image data, especially on image-to-image translation tasks. However, due to the complexity of these tasks, state-of-the-art models often contain a tremendous amount of parameters, which results in large model size and long inference time. In this work, we propose a novel method to address this problem by applying knowledge distillation together with distillation of a semantic relation preserving matrix. This matrix, derived from the teacher's feature encoding, helps the student model learn better semantic relations. In contrast to existing compression methods designed for classification tasks, our proposed method adapts well to the image-to-image translation task on GANs. Experiments conducted on 5 different datasets and 3 different pairs of teacher and student models provide strong evidence that our methods achieve impressive results both qualitatively and quantitatively.
翻译:然而,由于这些任务的复杂性,最先进的模型往往包含大量参数,从而导致巨大的模型大小和长期的推导时间。在这项工作中,我们提出了一个新颖的方法,通过应用知识蒸馏和提炼语义关系保护矩阵来解决这一问题。这个矩阵来自教师的特征编码,有助于学生模型学习更好的语义关系。与为分类任务设计的现有的压缩方法相比,我们建议的方法非常适合GAN的图像到图像翻译任务。在5个不同的数据集和3个不同的师生模型上进行的实验提供了有力的证据,证明我们的方法在质量和数量上都取得了令人印象深刻的成果。