3D自动驾驶中的占据估计: 一个简单的尝试 (A Simple Attempt for 3D Occupancy Estimation in Autonomous Driving) - 专知论文

会员服务 ·

0

深度估计 · 3D · 自动驾驶 · 驾驶环境 · 网络设计 ·

2023 年 4 月 4 日

A Simple Attempt for 3D Occupancy Estimation in Autonomous Driving

翻译：3D自动驾驶中的占据估计: 一个简单的尝试

Wanshui Gan,Ningkai Mo,Hongbin Xu,Naoto Yokoya

The task of estimating 3D occupancy from surrounding view images is an exciting development in the field of autonomous driving, following the success of Birds Eye View (BEV) perception.This task provides crucial 3D attributes of the driving environment, enhancing the overall understanding and perception of the surrounding space. However, there is still a lack of a baseline to define the task, such as network design, optimization, and evaluation. In this work, we present a simple attempt for 3D occupancy estimation, which is a CNN-based framework designed to reveal several key factors for 3D occupancy estimation. In addition, we explore the relationship between 3D occupancy estimation and other related tasks, such as monocular depth estimation, stereo matching, and BEV perception (3D object detection and map segmentation), which could advance the study on 3D occupancy estimation. For evaluation, we propose a simple sampling strategy to define the metric for occupancy evaluation, which is flexible for current public datasets. Moreover, we establish a new benchmark in terms of the depth estimation metric, where we compare our proposed method with monocular depth estimation methods on the DDAD and Nuscenes datasets.The relevant code will be available in https://github.com/GANWANSHUI/SimpleOccupancy

翻译：通过周围视图图像估计三维占据是自动驾驶领域的一个令人兴奋的发展，其紧随鸟瞰图（BEV）感知的成功。这项任务提供了关键的驾驶环境三维属性，增强了对周围空间的整体理解和感知。然而，仍然缺乏定义该任务的基线，例如网络设计、优化和评估。本文提出了一个简单的3D占据估计方法，这是一个基于卷积神经网络的框架，旨在揭示3D占据估计的几个关键因素。此外，我们探讨了3D占据估计与其他相关任务（如单眼深度估计、立体匹配和BEV感知（三维物体检测和地图分割））之间的关系，这可能推动3D占据估计的研究。对于评估，我们提出了一种简单的采样策略来定义占据评估指标，这对于当前公共数据集是灵活的。此外，我们在DDAD和Nuscenes数据集上建立了新的基准，以深度估计指标为基础，其中我们将我们提出的方法与单眼深度估计方法进行了比较。相关的代码将在https://github.com/GANWANSHUI/SimpleOccupancy上提供。

0

相关内容

深度估计

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

专知会员服务

18+阅读 · 2022年3月19日

【吉林大学等】三维人体运动预测研究综述，3D Human Motion Prediction : A Survey

【吉林大学等】三维人体运动预测研究综述，3D Human Motion Prediction : A Survey

专知会员服务

29+阅读 · 2022年3月8日

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

专知会员服务

26+阅读 · 2020年7月4日

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

专知会员服务

14+阅读 · 2020年6月18日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

运动物体检测与运动相机:一个全面的综述：Moving Objects Detection with a Moving Camera: A Comprehensive Review

运动物体检测与运动相机:一个全面的综述：Moving Objects Detection with a Moving Camera: A Comprehensive Review

专知会员服务

27+阅读 · 2020年1月17日

自动驾驶汽车的计算机视觉全面综述论文：问题、数据集和现状，附283页PDF下载

自动驾驶汽车的计算机视觉全面综述论文：问题、数据集和现状，附283页PDF下载

专知会员服务

113+阅读 · 2019年12月20日

【深度估计| 2019最新综述】单目深度估计方法综述（Monocular Depth Estimation: A Survey）

专知会员服务

69+阅读 · 2019年11月23日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

BEVFormer：基于Transformer的自动驾驶BEV纯视觉感知

BEVFormer：基于Transformer的自动驾驶BEV纯视觉感知

PaperWeekly

1+阅读 · 2022年6月21日

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

计算机视觉life

41+阅读 · 2019年7月22日

【泡泡一分钟】单目视觉惯性SLAM的重定位，全局优化和地图融合

【泡泡一分钟】单目视觉惯性SLAM的重定位，全局优化和地图融合

泡泡机器人SLAM

59+阅读 · 2019年7月15日

CVPR2019 | 15篇论文速递（涵盖目标检测、语义分割和姿态估计等方向）

CVPR2019 | 15篇论文速递（涵盖目标检测、语义分割和姿态估计等方向）

AI研习社

15+阅读 · 2019年5月8日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

【泡泡图灵智库】VINS-Mono：一种鲁棒多功能的单目视觉惯性状态估计器

【泡泡图灵智库】VINS-Mono：一种鲁棒多功能的单目视觉惯性状态估计器

泡泡机器人SLAM

19+阅读 · 2018年12月23日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【泡泡一分钟】将3D全卷积网络应用于车辆激光点云处理（IROS-11）

【泡泡一分钟】将3D全卷积网络应用于车辆激光点云处理（IROS-11）

泡泡机器人SLAM

13+阅读 · 2018年3月23日

基于车载激光点云的城市道路三维精细重建

国家自然科学基金

0+阅读 · 2015年12月31日

基于三维点云无网格处理的大型复杂锻件结构特征曲线重建

国家自然科学基金

0+阅读 · 2013年12月31日

图像增强下的ACL三维重建研究

国家自然科学基金

0+阅读 · 2013年12月31日

一类新的芬斯勒度量的曲率性质

国家自然科学基金

1+阅读 · 2013年12月31日

无线传感器网络中功率受限的分布式矢量估计

国家自然科学基金

0+阅读 · 2013年12月31日

基于低共熔溶剂的稀土表面合金燃料电池催化剂的制备，性能及第一性原理计算研究

国家自然科学基金

0+阅读 · 2012年12月31日

指标定理、椭圆亏格、非交换留数和热核

国家自然科学基金

0+阅读 · 2012年12月31日

铜族与稀土二元金属氧化物团簇低温催化氧化一氧化碳的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于局部仿射不变特征的移动机器人单目vSLAM研究

国家自然科学基金

0+阅读 · 2009年12月31日

氧化物表面氧缺陷和表面羟基在多相催化体系中作用的模型体系研究

国家自然科学基金

0+阅读 · 2008年12月31日

Complexity measure, kernel density estimation, bandwidth selection, and the efficient market hypothesis

Arxiv

0+阅读 · 2023年5月22日

uCTRL: Unbiased Contrastive Representation Learning via Alignment and Uniformity for Collaborative Filtering

Arxiv

0+阅读 · 2023年5月22日

Deep Radar Inverse Sensor Models for Dynamic Occupancy Grid Maps

Arxiv

0+阅读 · 2023年5月21日

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving

Arxiv

0+阅读 · 2023年5月20日

AutoDRIVE: A Comprehensive, Flexible and Integrated Digital Twin Ecosystem for Enhancing Autonomous Driving Research and Education

Arxiv

0+阅读 · 2023年5月20日

Video Killed the HD-Map: Predicting Driving Behavior Directly From Drone Images

Arxiv

0+阅读 · 2023年5月19日

Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey

Arxiv

17+阅读 · 2022年5月10日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Arxiv

19+阅读 · 2020年12月17日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

【CVPR2022】自动驾驶中的伪双目三维目标检测，Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving

专知会员服务

18+阅读 · 2022年3月19日

【吉林大学等】三维人体运动预测研究综述，3D Human Motion Prediction : A Survey

【吉林大学等】三维人体运动预测研究综述，3D Human Motion Prediction : A Survey

专知会员服务

29+阅读 · 2022年3月8日

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

【MIT】从视频物理系统进行因果发现，Causal Discovery in Physical Systems from Videos

专知会员服务

26+阅读 · 2020年7月4日

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，56页ppt，Neural Topological SLAM for Visual Navigation

专知会员服务

14+阅读 · 2020年6月18日

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

【CVPR2020】视觉导航的神经拓扑SLAM，Neural Topological SLAM for Visual Navigation

专知会员服务

52+阅读 · 2020年5月26日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

运动物体检测与运动相机:一个全面的综述：Moving Objects Detection with a Moving Camera: A Comprehensive Review

运动物体检测与运动相机:一个全面的综述：Moving Objects Detection with a Moving Camera: A Comprehensive Review

专知会员服务

27+阅读 · 2020年1月17日

自动驾驶汽车的计算机视觉全面综述论文：问题、数据集和现状，附283页PDF下载

自动驾驶汽车的计算机视觉全面综述论文：问题、数据集和现状，附283页PDF下载

专知会员服务

113+阅读 · 2019年12月20日

【深度估计| 2019最新综述】单目深度估计方法综述（Monocular Depth Estimation: A Survey）

专知会员服务

69+阅读 · 2019年11月23日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

BEVFormer：基于Transformer的自动驾驶BEV纯视觉感知

BEVFormer：基于Transformer的自动驾驶BEV纯视觉感知

PaperWeekly

1+阅读 · 2022年6月21日

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

ICRA 2019 论文速览 | 基于Deep Learning 的SLAM

计算机视觉life

41+阅读 · 2019年7月22日

【泡泡一分钟】单目视觉惯性SLAM的重定位，全局优化和地图融合

【泡泡一分钟】单目视觉惯性SLAM的重定位，全局优化和地图融合

泡泡机器人SLAM

59+阅读 · 2019年7月15日

CVPR2019 | 15篇论文速递（涵盖目标检测、语义分割和姿态估计等方向）

CVPR2019 | 15篇论文速递（涵盖目标检测、语义分割和姿态估计等方向）

AI研习社

15+阅读 · 2019年5月8日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

【泡泡图灵智库】VINS-Mono：一种鲁棒多功能的单目视觉惯性状态估计器

【泡泡图灵智库】VINS-Mono：一种鲁棒多功能的单目视觉惯性状态估计器

泡泡机器人SLAM

19+阅读 · 2018年12月23日

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

【泡泡一分钟】用于RGBD语义分割的三维图神经网络(ICCV2017-546)

泡泡机器人SLAM

22+阅读 · 2018年12月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【泡泡一分钟】将3D全卷积网络应用于车辆激光点云处理（IROS-11）

【泡泡一分钟】将3D全卷积网络应用于车辆激光点云处理（IROS-11）

泡泡机器人SLAM

13+阅读 · 2018年3月23日

相关论文

Complexity measure, kernel density estimation, bandwidth selection, and the efficient market hypothesis

Arxiv

0+阅读 · 2023年5月22日

uCTRL: Unbiased Contrastive Representation Learning via Alignment and Uniformity for Collaborative Filtering

Arxiv

0+阅读 · 2023年5月22日

Deep Radar Inverse Sensor Models for Dynamic Occupancy Grid Maps

Arxiv

0+阅读 · 2023年5月21日

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving

Arxiv

0+阅读 · 2023年5月20日

AutoDRIVE: A Comprehensive, Flexible and Integrated Digital Twin Ecosystem for Enhancing Autonomous Driving Research and Education

Arxiv

0+阅读 · 2023年5月20日

Video Killed the HD-Map: Predicting Driving Behavior Directly From Drone Images

Arxiv

0+阅读 · 2023年5月19日

Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey

Arxiv

17+阅读 · 2022年5月10日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

3D Object Detection for Autonomous Driving: A Survey

Arxiv

12+阅读 · 2021年6月21日

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Arxiv

19+阅读 · 2020年12月17日

相关基金

基于车载激光点云的城市道路三维精细重建

国家自然科学基金

0+阅读 · 2015年12月31日

基于三维点云无网格处理的大型复杂锻件结构特征曲线重建

国家自然科学基金

0+阅读 · 2013年12月31日

图像增强下的ACL三维重建研究

国家自然科学基金

0+阅读 · 2013年12月31日

一类新的芬斯勒度量的曲率性质

国家自然科学基金

1+阅读 · 2013年12月31日

无线传感器网络中功率受限的分布式矢量估计

国家自然科学基金

0+阅读 · 2013年12月31日

基于低共熔溶剂的稀土表面合金燃料电池催化剂的制备，性能及第一性原理计算研究

国家自然科学基金

0+阅读 · 2012年12月31日

指标定理、椭圆亏格、非交换留数和热核

国家自然科学基金

0+阅读 · 2012年12月31日

铜族与稀土二元金属氧化物团簇低温催化氧化一氧化碳的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于局部仿射不变特征的移动机器人单目vSLAM研究

国家自然科学基金

0+阅读 · 2009年12月31日

氧化物表面氧缺陷和表面羟基在多相催化体系中作用的模型体系研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员