LVIS 挑战轨道技术报告第一地点解决办法:大型词汇区划分配平衡和边界改善 (LVIS Challenge Track Technical Report 1st Place Solution: Distribution Balanced and Boundary Refinement for Large Vocabulary Instance Segmentation) - 专知论文

会员服务 ·

0

掩码 · 词表 · 示例 · 早停 · 损失函数（机器学习） ·

2021 年 11 月 5 日

LVIS Challenge Track Technical Report 1st Place Solution: Distribution Balanced and Boundary Refinement for Large Vocabulary Instance Segmentation

翻译：LVIS 挑战轨道技术报告第一地点解决办法:大型词汇区划分配平衡和边界改善

WeiFu Fu,CongChong Nie,Ting Sun,Jun Liu,TianLiang Zhang,Yong Liu

This report introduces the technical details of the team FuXi-Fresher for LVIS Challenge 2021. Our method focuses on the problem in following two aspects: the long-tail distribution and the segmentation quality of mask and boundary. Based on the advanced HTC instance segmentation algorithm, we connect transformer backbone(Swin-L) through composite connections inspired by CBNetv2 to enhance the baseline results. To alleviate the problem of long-tail distribution, we design a Distribution Balanced method which includes dataset balanced and loss function balaced modules. Further, we use a Mask and Boundary Refinement method composed with mask scoring and refine-mask algorithms to improve the segmentation quality. In addition, we are pleasantly surprised to find that early stopping combined with EMA method can achieve a great improvement. Finally, by using multi-scale testing and increasing the upper limit of the number of objects detected per image, we achieved more than 45.4% boundary AP on the val set of LVIS Challenge 2021. On the test data of LVIS Challenge 2021, we rank 1st and achieve 48.1% AP. Notably, our APr 47.5% is very closed to the APf 48.0%.

翻译：本报告介绍FuXi-FresherLVIS 挑战2021小组的技术细节。我们的方法侧重于以下两个方面的问题:蒙面和边界的长尾分布和分解质量。根据先进的HTC例分化算法,我们通过CBNetv2的复合连接连接变压器主干网(Swin-L)以加强基线结果。为了减轻长尾分布问题,我们设计了一种分配平衡平衡法,其中包括数据集平衡和损耗功能模块。此外,我们用蒙面和边界精细微算法构成的面罩和边界精细化方法来提高分解质量。此外,我们很惊讶地发现,与EMA方法的早期停止结合能够取得很大的改进。最后,通过多尺度测试和增加每个图像所检测对象的上限,我们在LVIS 挑战2021年的val集中实现了超过45.4%的边界AP。关于LVIS挑战2021的测试数据,我们排在1级和48.1%的AP.0至48.1%的APr%已经关闭。

0

相关内容

零样本图像分类综述

专知会员服务

52+阅读 · 2021年5月15日

图像分割方法综述

图像分割方法综述

专知会员服务

56+阅读 · 2020年11月22日

【视频目标检测与跟踪：综述论文】Video Object Segmentation and Tracking: A Survey

专知会员服务

66+阅读 · 2020年6月4日

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

专知会员服务

63+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【图机器学习论文】综述：图嵌入技术、应用和性能（Graph Embedding Techniques, Applications, and Performance: A Survey）

【图机器学习论文】综述：图嵌入技术、应用和性能（Graph Embedding Techniques, Applications, and Performance: A Survey）

专知会员服务

73+阅读 · 2019年12月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Technical Report: Bundling Linked Data Structures for Linearizable Range Queries

Arxiv

0+阅读 · 2022年1月3日

Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation

Arxiv

9+阅读 · 2021年12月3日

SOLQ: Segmenting Objects by Learning Queries

Arxiv

8+阅读 · 2021年9月30日

Joint Inductive and Transductive Learning for Video Object Segmentation

Arxiv

5+阅读 · 2021年8月8日

Efficient Iterative Amortized Inference for Learning Symmetric and Disentangled Multi-Object Representations

Arxiv

5+阅读 · 2021年6月7日

S4Net: Single Stage Salient-Instance Segmentation

S4Net: Single Stage Salient-Instance Segmentation

Arxiv

10+阅读 · 2019年4月10日

Automatically Designing CNN Architectures for Medical Image Segmentation

Automatically Designing CNN Architectures for Medical Image Segmentation

Arxiv

10+阅读 · 2018年7月19日

Object detection at 200 Frames Per Second

Arxiv

5+阅读 · 2018年5月16日

On the iterative refinement of densely connected representation levels for semantic segmentation

Arxiv

6+阅读 · 2018年4月30日

Path Aggregation Network for Instance Segmentation

Arxiv

3+阅读 · 2018年3月5日

VIP会员

文章信息

相关主题

损失函数（机器学习）

相关VIP内容

零样本图像分类综述

专知会员服务

52+阅读 · 2021年5月15日

图像分割方法综述

图像分割方法综述

专知会员服务

56+阅读 · 2020年11月22日

【视频目标检测与跟踪：综述论文】Video Object Segmentation and Tracking: A Survey

专知会员服务

66+阅读 · 2020年6月4日

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

【综述】生成式对抗网络(GANs)最新2020综述:挑战、解决方案和未来方向，Generative Adversarial Networks (GANs): Challenges, Solutions, and Future Directions

专知会员服务

63+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【图机器学习论文】综述：图嵌入技术、应用和性能（Graph Embedding Techniques, Applications, and Performance: A Survey）

【图机器学习论文】综述：图嵌入技术、应用和性能（Graph Embedding Techniques, Applications, and Performance: A Survey）

专知会员服务

73+阅读 · 2019年12月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025教程】基础模型遇见具身智能体

军事机器学习设计：关于开发自动化任务摘要系统的梯次化设计科学研究 | 2025最新93页

扩散模型中的缓存方法综述：迈向高效的多模态生成

【ICCV2025教程】《迈向视觉语言模型的全面推理》

相关资讯

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Technical Report: Bundling Linked Data Structures for Linearizable Range Queries

Arxiv

0+阅读 · 2022年1月3日

Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation

Arxiv

9+阅读 · 2021年12月3日

SOLQ: Segmenting Objects by Learning Queries

Arxiv

8+阅读 · 2021年9月30日

Joint Inductive and Transductive Learning for Video Object Segmentation

Arxiv

5+阅读 · 2021年8月8日

Efficient Iterative Amortized Inference for Learning Symmetric and Disentangled Multi-Object Representations

Arxiv

5+阅读 · 2021年6月7日

S4Net: Single Stage Salient-Instance Segmentation

S4Net: Single Stage Salient-Instance Segmentation

Arxiv

10+阅读 · 2019年4月10日

Automatically Designing CNN Architectures for Medical Image Segmentation

Automatically Designing CNN Architectures for Medical Image Segmentation

Arxiv

10+阅读 · 2018年7月19日

Object detection at 200 Frames Per Second

Arxiv

5+阅读 · 2018年5月16日

On the iterative refinement of densely connected representation levels for semantic segmentation

Arxiv

6+阅读 · 2018年4月30日

Path Aggregation Network for Instance Segmentation

Arxiv

3+阅读 · 2018年3月5日

微信扫码咨询专知VIP会员