This paper presents Mask-aware Intersection-over-Union (maIoU) for assigning anchor boxes as positives and negatives during training of instance segmentation methods. Unlike conventional IoU or its variants, which only considers the proximity of two boxes; maIoU consistently measures the proximity of an anchor box with not only a ground truth box but also its associated ground truth mask. Thus, additionally considering the mask, which, in fact, represents the shape of the object, maIoU enables a more accurate supervision during training. We present the effectiveness of maIoU on a state-of-the-art (SOTA) assigner, ATSS, by replacing IoU operation by our maIoU and training YOLACT, a SOTA real-time instance segmentation method. Using ATSS with maIoU consistently outperforms (i) ATSS with IoU by $\sim 1$ mask AP, (ii) baseline YOLACT with fixed IoU threshold assigner by $\sim 2$ mask AP over different image sizes and (iii) decreases the inference time by $25 \%$ owing to using less anchors. Then, exploiting this efficiency, we devise maYOLACT, a faster and $+6$ AP more accurate detector than YOLACT. Our best model achieves $37.7$ mask AP at $25$ fps on COCO test-dev establishing a new state-of-the-art for real-time instance segmentation. Code is available at https://github.com/kemaloksuz/Mask-aware-IoU
翻译:本文展示了MaIOU, 用于将锚箱作为正值和负值在模拟分解方法培训中指定锚箱作为正值和负值。 与传统的IOU或其变体不同, 传统IOU或其变体仅考虑两个箱的距离; MAIOU一贯测量一个锚箱的距离, 不仅使用地面真相箱, 而且还使用相关的地面真相面具。 因此, 此外, 将遮罩作为目标的形状, MAIOU 使得在培训期间能够进行更准确的监督。 我们展示了MAIOU对一个状态艺术( SOTA) 指派人( ASS) 的效力, 替代IOU 或它的变异体, 仅考虑两个盒子的距离; MAIOU 的实时分解法方法, 不仅使用地面真相盒, 而且还用它来测量一个固定的IUU值, (二) 基准YOL ACT, 使用美元 固定的IUUMACT, 用美元 来测量更精确的 。