KFIOU 旋转对象探测损失 (The KFIoU Loss for Rotated Object Detection)

from arxiv, 17 pages, 6 figures, 7 tables, accepted by ICLR 2023, TensorFlow code: https://github.com/yangxue0827/RotationDetection, PyTorch code: https://github.com/open-mmlab/mmrotate, Jittor code: https://github.com/Jittor/JDet

Differing from the well-developed horizontal object detection area whereby the computing-friendly IoU based loss is readily adopted and well fits with the detection metrics. In contrast, rotation detectors often involve a more complicated loss based on SkewIoU which is unfriendly to gradient-based training. In this paper, we propose an effective approximate SkewIoU loss based on Gaussian modeing and Kalman filter, which mainly consists of two items. The first term is a scale-insensitive center point loss, which is used to quickly get the center points between bounding boxes closer to assist the second term. In the distance-independent second term, Kalman filter is adopted to inherently mimic the mechanism of SkewIoU by its definition, and show its alignment with the SkewIoU loss at trend-level within a certain distance (i.e. within 9 pixels). This is in contrast to recent Gaussian modeling based rotation detectors e.g. GWD loss and KLD loss that involve a human-specified distribution distance metric which require additional hyperparameter tuning that vary across datasets and detectors. The resulting new loss called KFIoU loss is easier to implement and works better compared with exact SkewIoU loss, thanks to its full differentiability and ability to handle the non-overlapping cases. We further extend our technique to the 3-D case which also suffers from the same issues as 2-D detection. Extensive results on various public datasets (2-D/3-D, aerial/text/face images) with different base detectors show the effectiveness of our approach.

翻译：与发达的横向物体探测区不同,前者是计算友好型的基于IOU的丢失很容易被采纳,而且与探测指标相适应。相比之下,轮换探测器往往涉及基于SkewIoU的复杂损失,而SkewIoU对基于梯度的培训不友好。在本文中,我们建议基于高斯模式和卡尔曼过滤器的KsewIoU损失具有有效的近似性,主要由两个项目组成。第一个术语是规模不敏感的中心点损失,用于迅速使捆绑框之间的中心点更接近协助第二个术语。在第二术语的距离上,Kalman过滤器往往在本质上模仿SkewIoU的机制,而SkewIoU的这一机制对梯度不友好。在一定的距离内(即9个像素里),我们提出了与SkewIOU在趋势上的损失的匹配度。这与最近高斯的基于模型的轮换探测器(例如GWD损失和KLD损失,这涉及额外的超值分布距离测量仪,这需要额外的超光度测量仪,在数据设置和探测器之间不易进行。导致KSKFI的全面损失,因此采用新的数据损失为不同的计算,新的数据记录,因此成本损失为不同的计算,因此,因此,新的损失要求采用新的数据记录是不同的计算,与SKFLFLFLFLFLLLLLLLLLLV的计算到不同的计算。