Mainstream object detectors are commonly constituted of two sub-tasks, including classification and regression tasks, implemented by two parallel heads. This classic design paradigm inevitably leads to inconsistent spatial distributions between classification score and localization quality (IOU). Therefore, this paper alleviates this misalignment in the view of knowledge distillation. First, we observe that the massive teacher achieves a higher proportion of harmonious predictions than the lightweight student. Based on this intriguing observation, a novel Harmony Score (HS) is devised to estimate the alignment of classification and regression qualities. HS models the relationship between two sub-tasks and is seen as prior knowledge to promote harmonious predictions for the student. Second, this spatial misalignment will result in inharmonious region selection when distilling features. To alleviate this problem, a novel Task-decoupled Feature Distillation (TFD) is proposed by flexibly balancing the contributions of classification and regression tasks. Eventually, HD and TFD constitute the proposed method, named Task-Balanced Distillation (TBD). Extensive experiments demonstrate the considerable potential and generalization of the proposed method. Specifically, when equipped with TBD, RetinaNet with ResNet-50 achieves 41.0 mAP under the COCO benchmark, outperforming the recent FGD and FRS.
翻译:主流物体探测器通常由两个平行头头执行的两个子任务组成,包括分类和回归任务。这一典型的设计模式不可避免地导致分类评分和地方化质量之间的空间分布不一致。因此,本文件将减轻知识蒸馏方面的这种不协调现象。首先,我们观察到,大型教师比轻量级学生得到的和谐预测的比例更高。根据这一令人感兴趣的观察,设计了一个新的和谐评分(HS)来估计分类和回归质量的一致。HS模型模拟两个子任务之间的关系,并被视为促进学生和谐预测的先行知识。第二,这种空间错配将导致在蒸馏特性时出现不协调的区域选择。为了缓解这一问题,提出了一个新的任务错交的地貌蒸馏(TFD),通过灵活地平衡分类和回归任务的贡献。最终,HD和TFD是拟议的方法,称为TL-BS 蒸馏(TBD) 。广泛的实验表明,FADMM 具体地说,在REDM 和READMDM 之后,FADM 具体地显示FDDM 和REDMDM 和REDADDM 具体地表明,FADDMDM。