基于几何的用于单立体三维探测的远距离分解 (Geometry-based Distance Decomposition for Monocular 3D Object Detection) - 专知论文

会员服务 ·

0

目标检测 · 3D · 稳健性 · Projection · Performer ·

2022 年 6 月 29 日

Geometry-based Distance Decomposition for Monocular 3D Object Detection

翻译：基于几何的用于单立体三维探测的远距离分解

Xuepeng Shi,Qi Ye,Xiaozhi Chen,Chuangrong Chen,Zhixiang Chen,Tae-Kyun Kim

from arxiv, Accepted to ICCV 2021. Code: https://github.com/Rock-100/MonoDet

Monocular 3D object detection is of great significance for autonomous driving but remains challenging. The core challenge is to predict the distance of objects in the absence of explicit depth information. Unlike regressing the distance as a single variable in most existing methods, we propose a novel geometry-based distance decomposition to recover the distance by its factors. The decomposition factors the distance of objects into the most representative and stable variables, i.e. the physical height and the projected visual height in the image plane. Moreover, the decomposition maintains the self-consistency between the two heights, leading to robust distance prediction when both predicted heights are inaccurate. The decomposition also enables us to trace the causes of the distance uncertainty for different scenarios. Such decomposition makes the distance prediction interpretable, accurate, and robust. Our method directly predicts 3D bounding boxes from RGB images with a compact architecture, making the training and inference simple and efficient. The experimental results show that our method achieves the state-of-the-art performance on the monocular 3D Object Detection and Birds Eye View tasks of the KITTI dataset, and can generalize to images with different camera intrinsics.

翻译：单体 3D 对象探测对于自主驾驶意义重大,但仍然具有挑战性。核心挑战是在没有清晰深度信息的情况下预测天体的距离。与在大多数现有方法中将距离作为单一变量退缩不同, 我们提出一种新的基于几何的距离分解法, 以便用其因子恢复距离。分解将天体的距离与最具有代表性和稳定性的变量( 即物理高度和图像平面的预测视觉高度)联系起来。此外, 分解维持了两个高度之间的自我一致性, 导致在两种预测高度不准确时进行强力的距离预测。分解还使我们能够追踪不同情景中距离不确定性的原因。这种分解也使得距离预测可以解释、准确和坚固。我们的方法直接预测了3D 将立框与带有紧凑结构的 RGB 图像捆绑在一起, 使培训和推断简单而有效。实验结果显示, 我们的方法在单体 3D 对象探测和 Birks 眼视图任务都达到状态性, 能够与 KITTI 数据集中不同的图像进行总体化。

0

相关内容

目标检测

目标检测，也叫目标提取，是一种与计算机视觉和图像处理有关的计算机技术，用于检测数字图像和视频中特定类别的语义对象（例如人，建筑物或汽车）的实例。深入研究的对象检测领域包括面部检测和行人检测。对象检测在计算机视觉的许多领域都有应用，包括图像检索和视频监视。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

掺杂3d离子稀磁半导体微结构及磁光性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

AlN晶体的高效P型掺杂

国家自然科学基金

0+阅读 · 2012年12月31日

符合正电子湮没技术研究ZnO压敏电阻中缺陷和3d电子

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

加速器束流横向密度均匀化研究

国家自然科学基金

0+阅读 · 2011年12月31日

南海深海沉积物来源放线菌抗血管生成化学成分的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于Kirkendall效应制备CuO粒子填充的一维核壳纳米结构及其稀磁性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于并行微种群遗传算法的变密度地下水模拟优化模型研究

国家自然科学基金

0+阅读 · 2009年12月31日

铁磁基态锰氧化物中3d与4f电子磁性相互作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

MonoPCNS: Monocular 3D Object Detection via Point Cloud Network Simulation

Arxiv

0+阅读 · 2022年8月19日

PersDet: Monocular 3D Detection in Perspective Bird's-Eye-View

PersDet: Monocular 3D Detection in Perspective Bird's-Eye-View

Arxiv

0+阅读 · 2022年8月19日

GraVoS: Gradient based Voxel Selection for 3D Detection

GraVoS: Gradient based Voxel Selection for 3D Detection

Arxiv

0+阅读 · 2022年8月18日

Neural Capture of Animatable 3D Human from Monocular Video

Neural Capture of Animatable 3D Human from Monocular Video

Arxiv

0+阅读 · 2022年8月18日

Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in Driving Scenes

Arxiv

0+阅读 · 2022年8月18日

Detect and Approach: Close-Range Navigation Support for People with Blindness and Low Vision

Arxiv

0+阅读 · 2022年8月17日

SO(3)-Pose: SO(3)-Equivariance Learning for 6D Object Pose Estimation

Arxiv

1+阅读 · 2022年8月17日

An optimal sensors-based simulation method for spatiotemporal event detection

Arxiv

0+阅读 · 2022年8月16日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

VIP会员

文章信息

相关主题

相关VIP内容

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

MonoPCNS: Monocular 3D Object Detection via Point Cloud Network Simulation

Arxiv

0+阅读 · 2022年8月19日

PersDet: Monocular 3D Detection in Perspective Bird's-Eye-View

PersDet: Monocular 3D Detection in Perspective Bird's-Eye-View

Arxiv

0+阅读 · 2022年8月19日

GraVoS: Gradient based Voxel Selection for 3D Detection

GraVoS: Gradient based Voxel Selection for 3D Detection

Arxiv

0+阅读 · 2022年8月18日

Neural Capture of Animatable 3D Human from Monocular Video

Neural Capture of Animatable 3D Human from Monocular Video

Arxiv

0+阅读 · 2022年8月18日

Ret3D: Rethinking Object Relations for Efficient 3D Object Detection in Driving Scenes

Arxiv

0+阅读 · 2022年8月18日

Detect and Approach: Close-Range Navigation Support for People with Blindness and Low Vision

Arxiv

0+阅读 · 2022年8月17日

SO(3)-Pose: SO(3)-Equivariance Learning for 6D Object Pose Estimation

Arxiv

1+阅读 · 2022年8月17日

An optimal sensors-based simulation method for spatiotemporal event detection

Arxiv

0+阅读 · 2022年8月16日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

相关基金

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

掺杂3d离子稀磁半导体微结构及磁光性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

AlN晶体的高效P型掺杂

国家自然科学基金

0+阅读 · 2012年12月31日

符合正电子湮没技术研究ZnO压敏电阻中缺陷和3d电子

国家自然科学基金

0+阅读 · 2012年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

加速器束流横向密度均匀化研究

国家自然科学基金

0+阅读 · 2011年12月31日

南海深海沉积物来源放线菌抗血管生成化学成分的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于Kirkendall效应制备CuO粒子填充的一维核壳纳米结构及其稀磁性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于并行微种群遗传算法的变密度地下水模拟优化模型研究

国家自然科学基金

0+阅读 · 2009年12月31日

铁磁基态锰氧化物中3d与4f电子磁性相互作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员