This paper presents a new loss function for the prediction of oriented bounding boxes, named head-tail-loss. The loss function consists in minimizing the distance between the prediction and the annotation of two key points that are representing the annotation of the object. The first point is the center point and the second is the head of the object. However, for the second point, the minimum distance between the prediction and either the head or tail of the groundtruth is used. On this way, either prediction is valid (with the head pointing to the tail or the tail pointing to the head). At the end the importance is to detect the direction of the object but not its heading. The new loss function has been evaluated on the DOTA and HRSC2016 datasets and has shown potential for elongated objects such as ships and also for other types of objects with different shapes.
翻译:本文提出了一种用于预测方向性边界框的新损失函数,称为Head-tail Loss。该损失函数旨在最小化预测和关键点注释之间的距离,这两个关键点代表着物体的注释。第一个关键点是中心点,第二个关键点是物体的头部。然而,对于第二个关键点,使用预测与地面实况的头部或尾部之间的最小距离。这样,任何一个预测都有效(头指向尾部或尾部指向头部)。最终重要的是检测对象的方向而不是它的朝向。新的损失函数已经在DOTA和HRSC2016数据集上进行了评估,并且在船舶等伸长物体以及其他不同形状的物体方面表现出了潜力。