PatchDCT: 高质量案例分割的补丁改进 (PatchDCT: Patch Refinement for High Quality Instance Segmentation) - 专知论文

会员服务 ·

0

向量化 · 示例 · 掩码 · Mask-RCNN · state-of-the-art ·

2023 年 2 月 6 日

PatchDCT: Patch Refinement for High Quality Instance Segmentation

翻译：PatchDCT: 高质量案例分割的补丁改进

Qinrou Wen,Jirui Yang,Xue Yang,Kewei Liang

High-quality instance segmentation has shown emerging importance in computer vision. Without any refinement, DCT-Mask directly generates high-resolution masks by compressed vectors. To further refine masks obtained by compressed vectors, we propose for the first time a compressed vector based multi-stage refinement framework. However, the vanilla combination does not bring significant gains, because changes in some elements of the DCT vector will affect the prediction of the entire mask. Thus, we propose a simple and novel method named PatchDCT, which separates the mask decoded from a DCT vector into several patches and refines each patch by the designed classifier and regressor. Specifically, the classifier is used to distinguish mixed patches from all patches, and to correct previously mispredicted foreground and background patches. In contrast, the regressor is used for DCT vector prediction of mixed patches, further refining the segmentation quality at boundary locations. Experiments on COCO show that our method achieves 2.0%, 3.2%, 4.5% AP and 3.4%, 5.3%, 7.0% Boundary AP improvements over Mask-RCNN on COCO, LVIS, and Cityscapes, respectively. It also surpasses DCT-Mask by 0.7%, 1.1%, 1.3% AP and 0.9%, 1.7%, 4.2% Boundary AP on COCO, LVIS and Cityscapes. Besides, the performance of PatchDCT is also competitive with other state-of-the-art methods.

翻译：在计算机视野中,高品质的试样分解显示在计算机视野中已显露出重要性。 DCT- Mask 直接通过压缩矢量生成高分辨率面罩,而没有经过任何改进,DCT-Mask 直接生成压缩矢量获得的高分辨率面罩。为了进一步改进压缩矢量获得的面罩,我们首次提议了一个基于压缩矢量的多阶段完善框架。然而,香草组合并没有带来重大收益,因为DCT矢量的某些元素的变化将影响整个遮罩的预测。因此,我们提议了一个简单和新颖的方法,即PatchDCT,将DCT矢量从DCT矢量解码分为几个补丁,由设计分类器和递归者改进每个补补。具体来说,为了进一步细化压缩矢量,我们使用分类器来区分所有压缩矢量的混合面罩面罩,并纠正先前错误的地表和背景补补丁。相比之下,对DCT矢量的矢量预测将使用递归为DCT的矢量值,对边界值为2.0%、3.2%、4.5 AP和3.4%、5.0% AP-Mas-NNNE、LVIS-RBS-BS-BS-BS-BS-BS-BS、1.-BS-BS、1.BS、1.%L-BS-BS-BS-BS-R-R-BS-BS-BS-BS-BS-BS-R-R-BS-BS-BS-BS-BS-BS-BS-BS-BS-C-BS-BS-BS-BS-BS-BS-C-BS-BS-R-C-R-BS-C-C-B-B-B-B-R-R-R-R-R-R-R-R-B-RV-BS-BS-C-RV-V-C-V-R-R-R-R-R-R-R-R-R-R-R-R-V-V-R-R-R-R-R-R-R-R-R-R-R-R-R-R-R-R-R-

0

相关内容

向量化

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

专知会员服务

38+阅读 · 2020年3月23日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

AI研习社

11+阅读 · 2019年6月21日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

隐重子图条件下图的圈

国家自然科学基金

0+阅读 · 2015年12月31日

急性淋巴细胞白血病预后相关的非编码RNA调控网络和标志物研究

国家自然科学基金

0+阅读 · 2014年12月31日

靶向FGFR1新型复方DC疫苗抗肿瘤血管生成研究

国家自然科学基金

0+阅读 · 2014年12月31日

PARP-1调控急性肺损伤中中性粒细胞浸润和活化的作用及其分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

第二类Stirling数的单峰型问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

市政污泥生物炭稳定红壤中重金属的地球化学机制及关键影响因素

国家自然科学基金

0+阅读 · 2012年12月31日

CD4+CD25+调节性T细胞对肿瘤干细胞的影响及其调控机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

翻译调控肿瘤蛋白（TCTP）的高表达与结外鼻型NK/T细胞淋巴瘤细胞株TRAIL耐受的关系及可能的调控机制探讨

国家自然科学基金

0+阅读 · 2009年12月31日

神经元凋亡时Egr1对BH3-only蛋白Bim的转录调控

国家自然科学基金

0+阅读 · 2009年12月31日

Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval

Arxiv

0+阅读 · 2023年3月29日

Mask-Free Video Instance Segmentation

Arxiv

0+阅读 · 2023年3月28日

OpenInst: A Simple Query-Based Method for Open-World Instance Segmentation

Arxiv

0+阅读 · 2023年3月28日

Hi4D: 4D Instance Segmentation of Close Human Interaction

Arxiv

0+阅读 · 2023年3月27日

Parameter Efficient Local Implicit Image Function Network for Face Segmentation

Arxiv

0+阅读 · 2023年3月27日

A Generalized Framework for Video Instance Segmentation

Arxiv

0+阅读 · 2023年3月24日

FlexiViT: One Model for All Patch Sizes

Arxiv

0+阅读 · 2023年3月23日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

End-to-End Video Instance Segmentation with Transformers

Arxiv

10+阅读 · 2021年3月24日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

专知会员服务

38+阅读 · 2020年3月23日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

CVPR 2019 | 重磅！34篇 CVPR2019 论文实现代码

AI研习社

11+阅读 · 2019年6月21日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

相关论文

Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval

Arxiv

0+阅读 · 2023年3月29日

Mask-Free Video Instance Segmentation

Arxiv

0+阅读 · 2023年3月28日

OpenInst: A Simple Query-Based Method for Open-World Instance Segmentation

Arxiv

0+阅读 · 2023年3月28日

Hi4D: 4D Instance Segmentation of Close Human Interaction

Arxiv

0+阅读 · 2023年3月27日

Parameter Efficient Local Implicit Image Function Network for Face Segmentation

Arxiv

0+阅读 · 2023年3月27日

A Generalized Framework for Video Instance Segmentation

Arxiv

0+阅读 · 2023年3月24日

FlexiViT: One Model for All Patch Sizes

Arxiv

0+阅读 · 2023年3月23日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

End-to-End Video Instance Segmentation with Transformers

Arxiv

10+阅读 · 2021年3月24日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

相关基金

隐重子图条件下图的圈

国家自然科学基金

0+阅读 · 2015年12月31日

急性淋巴细胞白血病预后相关的非编码RNA调控网络和标志物研究

国家自然科学基金

0+阅读 · 2014年12月31日

靶向FGFR1新型复方DC疫苗抗肿瘤血管生成研究

国家自然科学基金

0+阅读 · 2014年12月31日

PARP-1调控急性肺损伤中中性粒细胞浸润和活化的作用及其分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

第二类Stirling数的单峰型问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

市政污泥生物炭稳定红壤中重金属的地球化学机制及关键影响因素

国家自然科学基金

0+阅读 · 2012年12月31日

CD4+CD25+调节性T细胞对肿瘤干细胞的影响及其调控机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

翻译调控肿瘤蛋白（TCTP）的高表达与结外鼻型NK/T细胞淋巴瘤细胞株TRAIL耐受的关系及可能的调控机制探讨

国家自然科学基金

0+阅读 · 2009年12月31日

神经元凋亡时Egr1对BH3-only蛋白Bim的转录调控

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员