改进未受监督的视频对象分割, 与动作- 请求同步 (Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy) - 专知论文

会员服务 ·

0

无监督 · tuning · CASES · 监督 · 评论员 ·

2022 年 12 月 17 日

Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy

翻译：改进未受监督的视频对象分割, 与动作- 请求同步

Long Lian,Zhirong Wu,Stella X. Yu

from arxiv, 15 pages, 10 figures

We present IMAS, a method that segments the primary objects in videos without manual annotation in training or inference. Previous methods in unsupervised video object segmentation (UVOS) have demonstrated the effectiveness of motion as either input or supervision for segmentation. However, motion signals may be uninformative or even misleading in cases such as deformable objects and objects with reflections, causing unsatisfactory segmentation. In contrast, IMAS achieves Improved UVOS with Motion-Appearance Synergy. Our method has two training stages: 1) a motion-supervised object discovery stage that deals with motion-appearance conflicts through a learnable residual pathway; 2) a refinement stage with both low- and high-level appearance supervision to correct model misconceptions learned from misleading motion cues. Additionally, we propose motion-semantic alignment as a model-agnostic annotation-free hyperparam tuning method. We demonstrate its effectiveness in tuning critical hyperparams previously tuned with human annotation or hand-crafted hyperparam-specific metrics. IMAS greatly improves the segmentation quality on several common UVOS benchmarks. For example, we surpass previous methods by 8.3% on DAVIS16 benchmark with only standard ResNet and convolutional heads. We intend to release our code for future research and applications.

翻译：我们提出IMAS,这是在没有人工说明培训或推断的情况下将主要对象分解成视频中的一种方法。以前在未经监督的视频对象分割法(UVOS)中采用的方法已经证明了运动作为分解的输入或监督效果。然而,在诸如变形物体和反射物体等情况中,运动信号可能缺乏信息,甚至误导,造成不令人满意的分解。相比之下,IMAS实现了改进UVOS,即运动-辅助同步。我们的方法有两个培训阶段:1)一个运动监督对象发现阶段,通过可学习的剩余路径处理运动-出现冲突;2)一个精细化阶段,通过低和高层次的外观监督,以纠正从误导运动提示中学到的模范误解。此外,我们建议运动-静态调整作为模型-不具有不令人满意的分解作用的超分解调法方法。我们展示了它在调整先前与人类说明或手制超标的超标的超标的超标。IMS大大改进了通过可学习的剩余路径处理运动-出现冲突;2)一个有低级外观的外观状态监督阶段监督,以纠正从误导性模型从误导运动手法中吸取的模型学质量质量质量,用前的UVIS基准,然后将若干项基准进行。

0

相关内容

无监督

【CVPR 2022】基于层次化视觉语言知识蒸馏的开放词汇单阶段检测，Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

【CVPR 2022】基于层次化视觉语言知识蒸馏的开放词汇单阶段检测，Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

专知会员服务

7+阅读 · 2022年3月19日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

Criegee自由基大气化学反应机理与二次有机气溶胶形成

国家自然科学基金

0+阅读 · 2014年12月31日

ω3多不饱和脂肪酸代谢产物前列腺素E3(PGE3)和消退素(resolvins)抗前列腺癌机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

生物质基呋喃二甲醛的绿色合成

国家自然科学基金

0+阅读 · 2014年12月31日

基于氧化石墨烯的纳米复合高吸水性聚合物耐盐、缓释性能调控及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

反铁磁结构中自旋极化波的激发及其太赫兹相干控制实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

survivin拮抗细胞衰老的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

stTRAIL-MSC靶向促进肝癌RFA过渡区残癌细胞凋亡的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

响应性纤维素接枝共聚物的合成与性能调控

国家自然科学基金

0+阅读 · 2009年12月31日

基于三维GIS的城市空间视觉分析研究

国家自然科学基金

2+阅读 · 2008年12月31日

Improving Massively Multilingual ASR With Auxiliary CTC Objectives

Improving Massively Multilingual ASR With Auxiliary CTC Objectives

Arxiv

0+阅读 · 2023年2月24日

Category-level Shape Estimation for Densely Cluttered Objects

Arxiv

0+阅读 · 2023年2月23日

Focusing On Targets For Improving Weakly Supervised Visual Grounding

Arxiv

0+阅读 · 2023年2月22日

Energy-Based Test Sample Adaptation for Domain Generalization

Arxiv

0+阅读 · 2023年2月22日

Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation

Arxiv

0+阅读 · 2023年2月22日

MetAug: Contrastive Learning via Meta Feature Augmentation

Arxiv

10+阅读 · 2022年3月10日

End-to-End Video Instance Segmentation with Transformers

Arxiv

10+阅读 · 2021年3月24日

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Arxiv

15+阅读 · 2020年3月31日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

Learning to Count Objects in Natural Images for Visual Question Answering

Arxiv

12+阅读 · 2018年2月15日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】基于层次化视觉语言知识蒸馏的开放词汇单阶段检测，Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

【CVPR 2022】基于层次化视觉语言知识蒸馏的开放词汇单阶段检测，Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

专知会员服务

7+阅读 · 2022年3月19日

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能行业：2027年AI预测报告

70页pdf《视觉-语言-动作模型综述：一种基于动作离散化的视角》

训练扩散模型其实比你想象的更简单！何恺明团队新作Dispersive Loss：给扩散模型加正则化

【ICML2025】用于可扩展持续强化学习的自组合策略

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

相关论文

Improving Massively Multilingual ASR With Auxiliary CTC Objectives

Improving Massively Multilingual ASR With Auxiliary CTC Objectives

Arxiv

0+阅读 · 2023年2月24日

Category-level Shape Estimation for Densely Cluttered Objects

Arxiv

0+阅读 · 2023年2月23日

Focusing On Targets For Improving Weakly Supervised Visual Grounding

Arxiv

0+阅读 · 2023年2月22日

Energy-Based Test Sample Adaptation for Domain Generalization

Arxiv

0+阅读 · 2023年2月22日

Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation

Arxiv

0+阅读 · 2023年2月22日

MetAug: Contrastive Learning via Meta Feature Augmentation

Arxiv

10+阅读 · 2022年3月10日

End-to-End Video Instance Segmentation with Transformers

Arxiv

10+阅读 · 2021年3月24日

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Arxiv

15+阅读 · 2020年3月31日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

Learning to Count Objects in Natural Images for Visual Question Answering

Arxiv

12+阅读 · 2018年2月15日

相关基金

Criegee自由基大气化学反应机理与二次有机气溶胶形成

国家自然科学基金

0+阅读 · 2014年12月31日

ω3多不饱和脂肪酸代谢产物前列腺素E3(PGE3)和消退素(resolvins)抗前列腺癌机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

生物质基呋喃二甲醛的绿色合成

国家自然科学基金

0+阅读 · 2014年12月31日

基于氧化石墨烯的纳米复合高吸水性聚合物耐盐、缓释性能调控及机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

反铁磁结构中自旋极化波的激发及其太赫兹相干控制实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

survivin拮抗细胞衰老的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

stTRAIL-MSC靶向促进肝癌RFA过渡区残癌细胞凋亡的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

响应性纤维素接枝共聚物的合成与性能调控

国家自然科学基金

0+阅读 · 2009年12月31日

基于三维GIS的城市空间视觉分析研究

国家自然科学基金

2+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员