Cascaded architectures have brought significant performance improvement in object detection and instance segmentation. However, there are lingering issues regarding the disparity in the Intersection-over-Union (IoU) distribution of the samples between training and inference. This disparity can potentially exacerbate detection accuracy. This paper proposes an architecture referred to as Sample Consistency Network (SCNet) to ensure that the IoU distribution of the samples at training time is close to that at inference time. Furthermore, SCNet incorporates feature relay and utilizes global contextual information to further reinforce the reciprocal relationships among classifying, detecting, and segmenting sub-tasks. Extensive experiments on the standard COCO dataset reveal the effectiveness of the proposed method over multiple evaluation metrics, including box AP, mask AP, and inference speed. In particular, while running 38\% faster, the proposed SCNet improves the AP of the box and mask predictions by respectively 1.3 and 2.3 points compared to the strong Cascade Mask R-CNN baseline. Code is available at \url{https://github.com/thangvubk/SCNet}.
翻译:封装结构在物体探测和试样分割方面带来了显著的性能改进,然而,在培训与推断之间样本的跨交统合(IoU)分布差异方面仍然存在一些问题,这种差异可能加剧检测准确性,本文件提议了一个称为抽样一致性网络(SCNet)的结构,以确保在培训时间将样本的IoU分布与推断时间相近,此外,SCNet纳入了特征中继器,并利用全球背景信息,进一步加强分类、检测和分解子任务之间的互惠关系。关于标准COCOCO数据集的广泛实验揭示了拟议方法对多个评价指标的有效性,包括箱式AP、面具AP和推断速度。特别是,拟议的SCNet在速度加快了38 ⁇ 的同时,将框和遮罩预测分别改善了1.3和2.3个百分点,而堡垒的精确度则与坚固的Cascade Make R-CN基线相比。代码见https://github.com/thangvub/SCNet。