创生反对立网络的语义分割效益 (Instance Semantic Segmentation Benefits from Generative Adversarial Networks)

In design of instance segmentation networks that reconstruct masks, segmentation is often taken as its literal definition -- assigning each pixel a label. This has led to thinking the problem as a template matching one with the goal of minimizing the loss between the reconstructed and the ground truth pixels. Rethinking reconstruction networks as a generator, we define the problem of predicting masks as a GANs game framework: A segmentation network generates the masks, and a discriminator network decides on the quality of the masks. To demonstrate this game, we show effective modifications on the general segmentation framework in Mask R-CNN. We find that playing the game in feature space is more effective than the pixel space leading to stable training between the discriminator and the generator, predicting object coordinates should be replaced by predicting contextual regions for objects, and overall the adversarial loss helps the performance and removes the need for any custom settings per different data domain. We test our framework in various domains and report on cellphone recycling, autonomous driving, large-scale object detection, and medical glands. We observe in general GANs yield masks that account for crispier boundaries, clutter, small objects, and details, being in domain of regular shapes or heterogeneous and coalescing shapes. Our code for reproducing the results is available publicly.

翻译：在设计重塑面罩的分解网络时,分解通常被当作其字面定义 -- -- 指派每个像素标签。这导致将问题视为一个模板,将一个模板与尽可能减少被重建者和地面真理像素之间损失的目标相匹配。重新思考重建网络作为生成者,我们把预测面罩的问题定义为GAN游戏框架: 分解网络产生面罩,而歧视者网络决定面罩的质量。为了展示这个游戏,我们展示了对Mas R-CNN中普通分解框架的有效修改。我们发现,在地貌空间中玩游戏比在像素空间里玩游戏比导致在导师和生成者之间进行稳定培训的平分解空间更有效,预测对象坐标应该被用预测对象的背景区域来取代,而总体的对抗性损失有助于性能,并消除对不同数据领域任何定制设置的需要。我们在不同领域测试我们的框架,并报告手机回收、自主驱动、大规模物体探测和医疗用地。我们发现,在一般的GANs生成的面罩比标比在导师和生成常规的磁度上, 我们的磁度、制成的磁度、制成的磁体的磁性、制成的磁性、制成的磁体、和制成。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

NeurIPS 2021丨K-Net: 迈向统一的图像分割

专知会员服务

17+阅读 · 2021年11月25日

【斯坦福&Facebook】生成式对抗变换器，Generative Adversarial Transformers

专知会员服务

21+阅读 · 2021年4月21日

【CVPR2020-CUHK】探索和利用GANs中的可解释语义，60页ppt，Exploring and Exploiting Interpretable Semantics in GANs

专知会员服务

13+阅读 · 2020年6月18日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日