全球一致的视频深度和以高效测试时间培训进行测算 (Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training) - 专知论文

会员服务 ·

0

估计/估计量 · 稳健性 · state-of-the-art · Integration · 讲稿 ·

2022 年 8 月 4 日

Globally Consistent Video Depth and Pose Estimation with Efficient Test-Time Training

翻译：全球一致的视频深度和以高效测试时间培训进行测算

Yao-Chih Lee,Kuan-Wei Tseng,Guan-Sheng Chen,Chu-Song Chen

from arxiv, github: https://github.com/yaochih/GCVD-release

Dense depth and pose estimation is a vital prerequisite for various video applications. Traditional solutions suffer from the robustness of sparse feature tracking and insufficient camera baselines in videos. Therefore, recent methods utilize learning-based optical flow and depth prior to estimate dense depth. However, previous works require heavy computation time or yield sub-optimal depth results. We present GCVD, a globally consistent method for learning-based video structure from motion (SfM) in this paper. GCVD integrates a compact pose graph into the CNN-based optimization to achieve globally consistent estimation from an effective keyframe selection mechanism. It can improve the robustness of learning-based methods with flow-guided keyframes and well-established depth prior. Experimental results show that GCVD outperforms the state-of-the-art methods on both depth and pose estimation. Besides, the runtime experiments reveal that it provides strong efficiency in both short- and long-term videos with global consistency provided.

翻译：传统的解决方案因零星地物跟踪和视频摄像基线不足而受到影响。因此,最近的方法在估计密度之前使用基于学习的光学流和深度,然而,以往的工作需要大量计算时间或产生低于最佳深度的结果。我们从本文的动作(SfM)中提出全球一致的基于学习的视频结构方法GCVD。GCVD在有线电视新闻网基础上的优化中加入了一个压缩的图像图,以便从有效的关键框架选择机制中实现全球一致的估算。它可以改进以学习为基础的方法的稳健性,同时使用流动制导关键框架,并事先建立完善的深度。实验结果显示,GCVD在深度和表面估计上都优于最先进的方法。此外,运行时间实验显示,它提供了全球一致性的短期和长期视频的强大效率。

0

相关内容

估计/估计量

估计/估计量

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

专知会员服务

21+阅读 · 2020年6月13日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

高维未知参数下的天波超视距雷达目标跟踪算法研究

国家自然科学基金

2+阅读 · 2015年12月31日

setdb1与Tiam1相互作用通过调控EMT促进肝癌侵袭转移

国家自然科学基金

0+阅读 · 2015年12月31日

肺癌组织小RNA修饰的特征分析及调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

靶向肿瘤干细胞治疗肝癌的多模态影像研究

国家自然科学基金

0+阅读 · 2014年12月31日

BAG3与MACC1相互作用在甲状腺癌细胞上皮间质转化(EMT) 及侵袭中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

基于多传感器的SLAM数据关联机理及关键技术研究

国家自然科学基金

3+阅读 · 2013年12月31日

肿瘤抑制基因XAF1治疗肿瘤转移的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

神经元凋亡时Egr1对BH3-only蛋白Bim的转录调控

国家自然科学基金

0+阅读 · 2009年12月31日

sRAGE对缺血/再灌注的心脏保护作用及其机制

国家自然科学基金

0+阅读 · 2008年12月31日

Temporally Consistent Video Transformer for Long-Term Video Prediction

Arxiv

0+阅读 · 2022年10月5日

Image Masking for Robust Self-Supervised Monocular Depth Estimation

Image Masking for Robust Self-Supervised Monocular Depth Estimation

Arxiv

0+阅读 · 2022年10月5日

Neural Residual Flow Fields for Efficient Video Representations

Arxiv

0+阅读 · 2022年10月5日

MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation

Arxiv

0+阅读 · 2022年10月5日

Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation

Arxiv

0+阅读 · 2022年10月5日

Efficient Estimation Under Data Fusion

Arxiv

0+阅读 · 2022年10月5日

Shape Completion with Points in the Shadow

Arxiv

0+阅读 · 2022年10月4日

Scalable Tail Latency Estimation for Data Center Networks

Arxiv

0+阅读 · 2022年9月30日

Learning Depth from Focus in the Wild

Arxiv

0+阅读 · 2022年9月30日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

VIP会员

文章信息

相关主题

估计/估计量

state-of-the-art

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

【三维物体和手部姿态估计】综述论文最新进展，Recent Advances in 3D Object and Hand Pose Estimation

专知会员服务

21+阅读 · 2020年6月13日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

GPT-5如何对齐？从硬性拒绝到安全完成：走向以输出为中心的安全训练

【伯克利博士论文】超越人类监督的视觉智能

【ICCV2025】SO(3) 上连续非保守动力系统的预测

2025年中国数据要素行业发展研究报告

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Temporally Consistent Video Transformer for Long-Term Video Prediction

Arxiv

0+阅读 · 2022年10月5日

Image Masking for Robust Self-Supervised Monocular Depth Estimation

Image Masking for Robust Self-Supervised Monocular Depth Estimation

Arxiv

0+阅读 · 2022年10月5日

Neural Residual Flow Fields for Efficient Video Representations

Arxiv

0+阅读 · 2022年10月5日

MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation

Arxiv

0+阅读 · 2022年10月5日

Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation

Arxiv

0+阅读 · 2022年10月5日

Efficient Estimation Under Data Fusion

Arxiv

0+阅读 · 2022年10月5日

Shape Completion with Points in the Shadow

Arxiv

0+阅读 · 2022年10月4日

Scalable Tail Latency Estimation for Data Center Networks

Arxiv

0+阅读 · 2022年9月30日

Learning Depth from Focus in the Wild

Arxiv

0+阅读 · 2022年9月30日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

相关基金

高维未知参数下的天波超视距雷达目标跟踪算法研究

国家自然科学基金

2+阅读 · 2015年12月31日

setdb1与Tiam1相互作用通过调控EMT促进肝癌侵袭转移

国家自然科学基金

0+阅读 · 2015年12月31日

肺癌组织小RNA修饰的特征分析及调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

靶向肿瘤干细胞治疗肝癌的多模态影像研究

国家自然科学基金

0+阅读 · 2014年12月31日

BAG3与MACC1相互作用在甲状腺癌细胞上皮间质转化(EMT) 及侵袭中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

基于多传感器的SLAM数据关联机理及关键技术研究

国家自然科学基金

3+阅读 · 2013年12月31日

肿瘤抑制基因XAF1治疗肿瘤转移的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

神经元凋亡时Egr1对BH3-only蛋白Bim的转录调控

国家自然科学基金

0+阅读 · 2009年12月31日

sRAGE对缺血/再灌注的心脏保护作用及其机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员