跨越多个域域进行不受监督的高效适应的知识蒸馏方法 (Knowledge Distillation Methods for Efficient Unsupervised Adaptation Across Multiple Domains)

Beyond the complexity of CNNs that require training on large annotated datasets, the domain shift between design and operational data has limited the adoption of CNNs in many real-world applications. For instance, in person re-identification, videos are captured over a distributed set of cameras with non-overlapping viewpoints. The shift between the source (e.g. lab setting) and target (e.g. cameras) domains may lead to a significant decline in recognition accuracy. Additionally, state-of-the-art CNNs may not be suitable for such real-time applications given their computational requirements. Although several techniques have recently been proposed to address domain shift problems through unsupervised domain adaptation (UDA), or to accelerate/compress CNNs through knowledge distillation (KD), we seek to simultaneously adapt and compress CNNs to generalize well across multiple target domains. In this paper, we propose a progressive KD approach for unsupervised single-target DA (STDA) and multi-target DA (MTDA) of CNNs. Our method for KD-STDA adapts a CNN to a single target domain by distilling from a larger teacher CNN, trained on both target and source domain data in order to maintain its consistency with a common representation. Our proposed approach is compared against state-of-the-art methods for compression and STDA of CNNs on the Office31 and ImageClef-DA image classification datasets. It is also compared against state-of-the-art methods for MTDA on Digits, Office31, and OfficeHome. In both settings -- KD-STDA and KD-MTDA -- results indicate that our approach can achieve the highest level of accuracy across target domains, while requiring a comparable or lower CNN complexity.

翻译：除了需要大规模附加说明数据集培训的CNN的复杂性之外,设计和操作数据之间的域变也限制了在很多现实世界应用程序中采用CNN。例如,在个人再识别方面,视频通过一组分布式摄影机拍摄,带有非重叠观点。源域(如实验室设置)和目标(如相机)域的转变可能导致识别准确性大幅下降。此外,鉴于计算要求,最先进的CNN可能不适合这类实时应用程序。虽然最近提出了一些技术,通过不受监督的域变换(UDA)解决CNN的域变换问题,或通过知识蒸馏(KD)来加速/压缩CNN的视频。我们试图同时调整和压缩CNN(如实验室设置)和目标(如照相机)域之间的变化可能导致识别准确性显著下降。我们提议了一种渐进式KDD(STDA)和多目标DA(MDA(MDA)方法)。我们KDA(KDA-DA)的方法是将CNN调整到一个单一目标域,通过比KMDA(KDA)更精确的域域域局,同时保持我们所训练的域域域域域域局的域域域域内的数据系统,也显示一个比KMDA(KMDA)的域域域域域域域内的数据系统,在比较比比的域域域数据系统(KDA)的标准化数据系统)的比较。