In this paper, we present a conceptually simple, strong, and efficient framework for fully- and weakly-supervised panoptic segmentation, called Panoptic FCN. Our approach aims to represent and predict foreground things and background stuff in a unified fully convolutional pipeline, which can be optimized with point-based fully or weak supervision. In particular, Panoptic FCN encodes each object instance or stuff category with the proposed kernel generator and produces the prediction by convolving the high-resolution feature directly. With this approach, instance-aware and semantically consistent properties for things and stuff can be respectively satisfied in a simple generate-kernel-then-segment workflow. Without extra boxes for localization or instance separation, the proposed approach outperforms the previous box-based and -free models with high efficiency. Furthermore, we propose a new form of point-based annotation for weakly-supervised panoptic segmentation. It only needs several random points for both things and stuff, which dramatically reduces the annotation cost of human. The proposed Panoptic FCN is also proved to have much superior performance in this weakly-supervised setting, which achieves 82% of the fully-supervised performance with only 20 randomly annotated points per instance. Extensive experiments demonstrate the effectiveness and efficiency of Panoptic FCN on COCO, VOC 2012, Cityscapes, and Mapillary Vistas datasets. And it sets up a new leading benchmark for both fully- and weakly-supervised panoptic segmentation. Our code and models are made publicly available at https://github.com/dvlab-research/PanopticFCN.
翻译:在本文中,我们提出了一个完整和薄弱监督的全光学截面结构概念简单、强大和高效的框架,称为Panpopt FCN。我们的方法旨在代表并预测在统一的全革命管道中出现的表面事情和背景材料,这种管道可以通过点基完全或薄弱的监督加以优化。特别是,Panpt FCN将每个物体实例或材料类别与拟议的内核生成器进行编码,并通过直接涉及高分辨率特性来作出预测。通过这种方法,在简单的生成内核内存薄弱的封闭性工作流程中,可以分别满足对事物和事物及材料的感官-直线性一致性特性。在没有额外的本地化或实例分离框的情况下,拟议的方法将超越先前的基于箱型和无源模型,且具有高效率。此外,我们提出了一种基于点的注释的新形式,即以低超强的光谱生成的光谱分解。对于高分辨率的事物和材料来说,只要有几处随机点即可大幅降低人类的直观成本。拟议的Panphic FCN 也证明它以甚优的超前一等标准,在2012年的运行前20个城市的常规测试中,在高端端/直观测试中也展示了甚高端运行。