In this study, we dive deep into the inconsistency of pseudo targets in semi-supervised object detection (SSOD). Our core observation is that the oscillating pseudo targets undermine the training of an accurate semi-supervised detector. It not only inject noise into student training but also lead to severe overfitting on the classification task. Therefore, we propose a systematic solution, termed Consistent-Teacher, to reduce the inconsistency. First, adaptive anchor assignment~(ASA) substitutes the static IoU-based strategy, which enables the student network to be resistant to noisy pseudo bounding boxes; Then we calibrate the subtask predictions by designing a 3D feature alignment module~(FAM-3D). It allows each classification feature to adaptively query the optimal feature vector for the regression task at arbitrary scales and locations. Lastly, a Gaussian Mixture Model (GMM) dynamically revises the score threshold of the pseudo-bboxes, which stabilizes the number of ground-truths at an early stage and remedies the unreliable supervision signal during training. Consistent-Teacher provides strong results on a large range of SSOD evaluations. It achieves 40.0 mAP with ResNet-50 backbone given only 10\% of annotated MS-COCO data, which surpasses previous baselines using pseudo labels by around 3 mAP. When trained on fully annotated MS-COCO with additional unlabeled data, the performance further increases to 47.2 mAP. Our code will be open-sourced soon.
翻译:在这项研究中,我们深入探讨半监督物体探测(裁军特别联大)中假目标的不一致性。我们的核心观察是,变形假目标破坏了准确的半监督探测器的培训。它不仅给学生培训注入噪音,而且导致分类任务严重过度。因此,我们提出了一个系统解决方案,称为“一致教学人”,以减少不一致性。首先,基于适应性的锚定任务~(ASA)取代基于静态的IOU战略,使学生网络能够抵抗噪音的伪约束盒;然后,我们通过设计一个3D特征校准模块~(FAM-3D)校准子任务预测。它允许每种分类特征在任意的尺度和地点对回归任务的最佳特性矢量进行适应性查询。最后,一个称为“一致教学师”的模型(GMMMM)动态地修订伪箱的评分阈值阈值阈值,以在早期稳定地面标数,并在培训期间纠正不可靠的监督信号。 一致的教研判官将很快通过一个大范围的AS-ROAP IMO 数据库(AS-ROAP) 完成前的升级数据基准。