Differing from the well-developed horizontal object detection area whereby the computing-friendly IoU based loss is readily adopted and well fits with the detection metrics. In contrast, rotation detectors often involve a more complicated loss based on SkewIoU which is unfriendly to gradient-based training. In this paper, we argue that one effective alternative is to devise an approximate loss who can achieve trend-level alignment with SkewIoU loss instead of the strict value-level identity. Specifically, we model the objects as Gaussian distribution and adopt Kalman filter to inherently mimic the mechanism of SkewIoU by its definition, and show its alignment with the SkewIoU at trend-level. This is in contrast to recent Gaussian modeling based rotation detectors e.g. GWD, KLD that involves a human-specified distribution distance metric which requires additional hyperparameter tuning. The resulting new loss called KFIoU is easier to implement and works better compared with exact SkewIoU, thanks to its full differentiability and ability to handle the non-overlapping cases. We further extend our technique to the 3-D case which also suffers from the same issues as 2-D detection. Extensive results on various public datasets (2-D/3-D, aerial/text/face images) with different base detectors show the effectiveness of our approach.
翻译:与开发良好的横向物体探测领域不同,后者使计算机友好型IOU基于IOU的损失很容易被采纳,而且与探测指标相适应。相比之下,轮换探测器往往涉及基于SkewIoU的更为复杂的损失,而SkewIoU对基于梯度的培训是不友好的。在本文中,我们认为,一个有效的替代办法是设计一种大致的损失,这种损失可以与SkeewIoU的损失实现趋势水平一致,而不是严格的价值水平特性。具体地说,我们用Gaussian分发和Kalman过滤器模型来模拟SkewIoU的内在机制,并显示其与SkeewIoU在趋势层面的一致性。这与最近高斯建模型的基于旋转探测器(例如GWD,KLD, KLD, 涉及人指定分布距离指标,这需要额外的超参数调整。由此产生的新的损失称为KFIOU,较容易执行和工作与精确的SkewIoU相比,因为其完全可及处理不重叠案件的能力不同。我们的技术与SkeewIoU在趋势上与Scal-D发现3-D结果也与3号不同。我们对3-D的深度探测结果进行了不同的调查。