We address the task of domain adaptation in object detection, where there is a domain gap between a domain with annotations (source) and a domain of interest without annotations (target). As an effective semi-supervised learning method, the teacher-student framework (a student model is supervised by the pseudo labels from a teacher model) has also yielded a large accuracy gain in cross-domain object detection. However, it suffers from the domain shift and generates many low-quality pseudo labels (\textit{e.g.,} false positives), which leads to sub-optimal performance. To mitigate this problem, we propose a teacher-student framework named Adaptive Teacher (AT) which leverages domain adversarial learning and weak-strong data augmentation to address the domain gap. Specifically, we employ feature-level adversarial training in the student model, allowing features derived from the source and target domains to share similar distributions. This process ensures the student model produces domain-invariant features. Furthermore, we apply weak-strong augmentation and mutual learning between the teacher model (taking data from the target domain) and the student model (taking data from both domains). This enables the teacher model to learn the knowledge from the student model without being biased to the source domain. We show that AT demonstrates superiority over existing approaches and even Oracle (fully-supervised) models by a large margin. For example, we achieve 50.9% (49.3%) mAP on Foggy Cityscape (Clipart1K), which is 9.2% (5.2%) and 8.2% (11.0%) higher than previous state-of-the-art and Oracle, respectively.
翻译:在目标检测中,我们处理域适应任务,即带有说明(源)的域与无说明(目标)的域间差距。作为一个有效的半监督的学习方法,师生框架(一个学生模型由教师模型的假标签监督)在跨域对象检测中也取得了很大的准确性收益。然而,它受到域变换的影响,产生了许多低质量的假标签(\ textit{e.g.}假正数),导致亚最佳业绩。为缓解这一问题,我们建议了一个叫作适应教师(AT)的师生框架,它利用域对抗性学习和弱强数据增强来缩小域差距。具体地说,我们在学生模型中采用特级对抗性培训,让来源和目标域的特征分享类似的分布。这一过程确保学生模型产生域变量变异性特征。此外,我们采用弱的增益和在教师模型(从目标域域的数据)和学生模型之间相互学习(从目标域域里为11.0)以及学生模型(从目标域里为0.9,我们从标本到大域域里的数据,我们用亚优的模型来学习。我们用现有的源底位模型来展示。我们用现有的源来学习。我们用原位模型来显示现有的源到基础。我们用原位的模型,或实验性模型,用原位的模型,用原位的模型来学习。(我们用原位的模型来显示的模型,用原位的模型来显示的模型,用原位的模型,用原位的模型,用原位的模型,用。