One-stage object detection is commonly implemented by optimizing two sub-tasks: object classification and localization, using heads with two parallel branches, which might lead to a certain level of spatial misalignment in predictions between the two tasks. In this work, we propose a Task-aligned One-stage Object Detection (TOOD) that explicitly aligns the two tasks in a learning-based manner. First, we design a novel Task-aligned Head (T-Head) which offers a better balance between learning task-interactive and task-specific features, as well as a greater flexibility to learn the alignment via a task-aligned predictor. Second, we propose Task Alignment Learning (TAL) to explicitly pull closer (or even unify) the optimal anchors for the two tasks during training via a designed sample assignment scheme and a task-aligned loss. Extensive experiments are conducted on MS-COCO, where TOOD achieves a 51.1 AP at single-model single-scale testing. This surpasses the recent one-stage detectors by a large margin, such as ATSS (47.7 AP), GFL (48.2 AP), and PAA (49.0 AP), with fewer parameters and FLOPs. Qualitative results also demonstrate the effectiveness of TOOD for better aligning the tasks of object classification and localization. Code is available at https://github.com/fcjian/TOOD.
翻译:通常通过优化两个子任务来实施一个阶段的物体探测:目标分类和定位,使用两个平行分支的负责人,这可能导致两个任务之间预测的空间差错,在这项工作中,我们提议采用一个与任务一致的单阶段物体探测(TOOD),以学习方式明确调整这两项任务。首先,我们设计一个新的与任务一致的负责人(T-Head),在学习任务互动和任务具体特点之间实现更好的平衡,以及更加灵活地通过一个任务匹配的预测器了解调整情况。第二,我们提议任务协调学习(TAL),在设计样本分配计划和任务调整损失过程中,明确拉近(甚至统一)两项任务的最佳锚定点。在MS-COCO(TOD)上进行了广泛的实验,在单一模型单一规模的测试中实现了51.1个AP(TH-F),这大大超过了最近的一阶段探测器,例如ATS(47.7 AP)、GLL(48.2 AP),以及PAAA(49),在培训期间,通过设计样本分配计划和任务调整任务分类中,将AP/ODQ(FOD/ODQ)的当地标准的参数和FOD/OD/OPOD 也减少了。