单声道: 从地面探测单声道 3D物体 (MonoGround: Detecting Monocular 3D Objects from the Ground) - 专知论文

会员服务 ·

0

3D · 估计/估计量 · LIDAR · Extensibility · 目标检测 ·

2022 年 6 月 15 日

MonoGround: Detecting Monocular 3D Objects from the Ground

翻译：单声道: 从地面探测单声道 3D物体

Zequn Qin,Xi Li

from arxiv, CVPR22

Monocular 3D object detection has attracted great attention for its advantages in simplicity and cost. Due to the ill-posed 2D to 3D mapping essence from the monocular imaging process, monocular 3D object detection suffers from inaccurate depth estimation and thus has poor 3D detection results. To alleviate this problem, we propose to introduce the ground plane as a prior in the monocular 3d object detection. The ground plane prior serves as an additional geometric condition to the ill-posed mapping and an extra source in depth estimation. In this way, we can get a more accurate depth estimation from the ground. Meanwhile, to take full advantage of the ground plane prior, we propose a depth-align training strategy and a precise two-stage depth inference method tailored for the ground plane prior. It is worth noting that the introduced ground plane prior requires no extra data sources like LiDAR, stereo images, and depth information. Extensive experiments on the KITTI benchmark show that our method could achieve state-of-the-art results compared with other methods while maintaining a very fast speed. Our code and models are available at https://github.com/cfzd/MonoGround.

翻译：由于单镜成像过程的2D至3D绘图精髓不正确,单镜3D天体探测的深度估计不准确,因此检测结果差。为了缓解这一问题,我们提议将地面平面作为单眼3D天体探测的先期。以前地面平面作为错误的绘图的附加几何条件和深度估计的额外来源。这样,我们可以从地面得到更准确的深度估计。与此同时,为了充分利用地面平面之前的充分利用,我们提出了深度高度训练战略和精确的两阶段深度推断方法,为地面平面之前专门设计。值得注意的是,引进地面平面以前不需要额外的数据源,如LIDAR、立体图像和深度信息。关于KITTI基准的广泛实验表明,我们的方法与其他方法相比,能够取得最新的结果,同时保持非常快速的速度。我们的代码和模型可以在 https://github.com/cfczd/Mongrogrous。

0

相关内容

3D是英文“Three Dimensions”的简称，中文是指三维、三个维度、三个坐标，即有长、有宽、有高，换句话说，就是立体的，是相对于只有长和宽的平面（2D）而言。

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

专知会员服务

18+阅读 · 2022年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Sel1L缺失对肝脏线粒体活性氧及脂质代谢平衡的影响研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于矢量非局部均值模型的PolSAR相干斑抑制技术及其效果评估体系研究

国家自然科学基金

0+阅读 · 2013年12月31日

CuFe2O4的形貌和尺寸可控合成及催化性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多模态MRI的神经节苷酯对鼻咽癌放射性脑损伤早期干预疗效的研究

国家自然科学基金

0+阅读 · 2013年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

大疱性类天疱疮合并神经系统损害体内试验及前瞻性研究

国家自然科学基金

0+阅读 · 2012年12月31日

针刺干预内质网应激调节脑缺血再灌注大鼠神经细胞自噬的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

功能MRI影像生物标记物评价抗肿瘤血管生成药物疗效的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于Kirkendall效应制备CuO粒子填充的一维核壳纳米结构及其稀磁性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

PeCLR: Self-Supervised 3D Hand Pose Estimation from monocular RGB via Equivariant Contrastive Learning

Arxiv

0+阅读 · 2022年8月3日

Gradient-based Uncertainty for Monocular Depth Estimation

Arxiv

0+阅读 · 2022年8月3日

Temporal Context for Robust Maritime Obstacle Detection

Arxiv

0+阅读 · 2022年8月3日

Deconstructing Self-Supervised Monocular Reconstruction: The Design Decisions that Matter

Deconstructing Self-Supervised Monocular Reconstruction: The Design Decisions that Matter

Arxiv

0+阅读 · 2022年8月2日

ProgressLabeller: Visual Data Stream Annotation for Training Object-Centric 3D Perception

Arxiv

0+阅读 · 2022年8月1日

MORE: Simultaneous Multi-View 3D Object Recognition and Pose Estimation

Arxiv

0+阅读 · 2022年8月1日

Bayesian Active Learning for Sim-to-Real Robotic Perception

Arxiv

0+阅读 · 2022年8月1日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

64+阅读 · 2021年10月25日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Monocular Object and Plane SLAM in Structured Environments

Monocular Object and Plane SLAM in Structured Environments

Arxiv

12+阅读 · 2018年9月10日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

专知会员服务

18+阅读 · 2022年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

赋能真实世界：基于大语言模型的产业智能体技术、实践与评测综述

军事行动中人工智能系统目标交战的附带损伤评估模型 | 最新文献

【普林斯顿博士论文】面向人本机器人学的安全与学习博弈论融合

美陆军协会（AUSA）2025 年会公布的美国十大武器与防务产品创新

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

PeCLR: Self-Supervised 3D Hand Pose Estimation from monocular RGB via Equivariant Contrastive Learning

Arxiv

0+阅读 · 2022年8月3日

Gradient-based Uncertainty for Monocular Depth Estimation

Arxiv

0+阅读 · 2022年8月3日

Temporal Context for Robust Maritime Obstacle Detection

Arxiv

0+阅读 · 2022年8月3日

Deconstructing Self-Supervised Monocular Reconstruction: The Design Decisions that Matter

Deconstructing Self-Supervised Monocular Reconstruction: The Design Decisions that Matter

Arxiv

0+阅读 · 2022年8月2日

ProgressLabeller: Visual Data Stream Annotation for Training Object-Centric 3D Perception

Arxiv

0+阅读 · 2022年8月1日

MORE: Simultaneous Multi-View 3D Object Recognition and Pose Estimation

Arxiv

0+阅读 · 2022年8月1日

Bayesian Active Learning for Sim-to-Real Robotic Perception

Arxiv

0+阅读 · 2022年8月1日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

64+阅读 · 2021年10月25日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Monocular Object and Plane SLAM in Structured Environments

Monocular Object and Plane SLAM in Structured Environments

Arxiv

12+阅读 · 2018年9月10日

相关基金

Sel1L缺失对肝脏线粒体活性氧及脂质代谢平衡的影响研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于矢量非局部均值模型的PolSAR相干斑抑制技术及其效果评估体系研究

国家自然科学基金

0+阅读 · 2013年12月31日

CuFe2O4的形貌和尺寸可控合成及催化性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多模态MRI的神经节苷酯对鼻咽癌放射性脑损伤早期干预疗效的研究

国家自然科学基金

0+阅读 · 2013年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

大疱性类天疱疮合并神经系统损害体内试验及前瞻性研究

国家自然科学基金

0+阅读 · 2012年12月31日

针刺干预内质网应激调节脑缺血再灌注大鼠神经细胞自噬的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

功能MRI影像生物标记物评价抗肿瘤血管生成药物疗效的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于Kirkendall效应制备CuO粒子填充的一维核壳纳米结构及其稀磁性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员