Recent attention in instance segmentation has focused on query-based models. Despite being non-maximum suppression (NMS)-free and end-to-end, the superiority of these models on high-accuracy real-time benchmarks has not been well demonstrated. In this paper, we show the strong potential of query-based models on efficient instance segmentation algorithm designs. We present FastInst, a simple, effective query-based framework for real-time instance segmentation. FastInst can execute at a real-time speed (i.e., 32.5 FPS) while yielding an AP of more than 40 (i.e., 40.5 AP) on COCO test-dev without bells and whistles. Specifically, FastInst follows the meta-architecture of recently introduced Mask2Former. Its key designs include instance activation-guided queries, dual-path update strategy, and ground truth mask-guided learning, which enable us to use lighter pixel decoders, fewer Transformer decoder layers, while achieving better performance. The experiments show that FastInst outperforms most state-of-the-art real-time counterparts, including strong fully convolutional baselines, in both speed and accuracy. Code can be found at https://github.com/junjiehe96/FastInst .
翻译:尽管这些模型在高精确度实时基准上具有优势,但并未很好地展示出这些模型在高精确度实时基准上的优越性。在本文中,我们展示了基于查询的模型在高效实例分解算法设计方面的巨大潜力。我们展示了快速Inst,一个简单、有效的基于查询的框架,用于实时实例分解。快速Inst能够以实时速度(即32.5 FPS)执行超过40(即40.5 AP)的COCO测试-dev(即40.5 AP)的AP,而没有钟声和哨声。具体地说,快速Inst遵循最近引入的Mask2 Former的元结构。其主要设计包括实例激活指导查询、双向更新战略和地面真相掩码指导学习,这使我们能够使用更轻的像素解码器,更少的变换器解码层,同时取得更好的业绩。实验显示,最快速的超模,最强的Conperforforstformatformations, imate-judeal-fathernal-deal-deal-deal-dealtimeal-deal suptraction.</s>