Instance segmentation is one of the fundamental vision tasks. Recently, fully convolutional instance segmentation methods have drawn much attention as they are often simpler and more efficient than two-stage approaches like Mask R-CNN. To date, almost all such approaches fall behind the two-stage Mask R-CNN method in mask precision when models have similar computation complexity, leaving great room for improvement. In this work, we achieve improved mask prediction by effectively combining instance-level information with semantic information with lower-level fine-granularity. Our main contribution is a blender module which draws inspiration from both top-down and bottom-up instance segmentation approaches. The proposed BlendMask can effectively predict dense per-pixel position-sensitive instance features with very few channels, and learn attention maps for each instance with merely one convolution layer, thus being fast in inference. BlendMask can be easily incorporated with the state-of-the-art one-stage detection frameworks and outperforms Mask R-CNN under the same training schedule while being 20% faster. A light-weight version of BlendMask achieves $ 34.2% $ mAP at 25 FPS evaluated on a single 1080Ti GPU card. Because of its simplicity and efficacy, we hope that our BlendMask could serve as a simple yet strong baseline for a wide range of instance-wise prediction tasks. Code is available at https://git.io/AdelaiDet
翻译:目前,几乎所有这类方法都落后于两阶段的Mask R-CNN 方法,当模型的计算复杂度相似时,在掩码精确度方面几乎都落后于两阶段的Mask R-CNN 方法,从而留下很大的改进空间。在这项工作中,我们通过有效地将实例级信息与语义信息与较低级微调级微调性信息相结合,实现改进遮罩预测。我们的主要贡献是一个混合器模块,它从上至下和下至上两个阶段的分解方法中汲取灵感。拟议的BlendMask 能够以很少的渠道有效地预测密度的每个像素位置敏感实例特征,并学习每种情况下仅使用一个变异层的注意图,从而可以很快地进行改进。BlendMask 能够很容易地与州级一级一级检测框架相结合,在相同的培训时间表下超越了Make R-CNN,同时速度更快。BlenMask 的轻度版本Blenal-Madeal-laimal 将达到我们GPLA的25% 的S-Syal Syal Seral Syal Serview a supal a sal a sal a supliflivieward sal a laviewal a laveal a sal a sal sal sal a laveal a laviewal a s s s lapal a latal lad lad s sal a sal a sal sal sal a sal a sal a sal sal latal sal laveal a lad sal lad sal lad sal sal lad sal lad sal lad sal sal lad sal lad sal sal lad sal lad sal sal lad sal lad lad lad ladal ladal ladal lad lad lad lad lad lad lad lad lad lad sal lad sal a sal lad ladal a sal sal a sal a sal a sal a sal la