利用感光器主域随机化缩小粒子估计网络的现实差距 (Bridging the Reality Gap for Pose Estimation Networks using Sensor-Based Domain Randomization) - 专知论文

会员服务 ·

0

估计/估计量 · Performer · 模型评估 · Networking · 训练数据 ·

2021 年 5 月 18 日

Bridging the Reality Gap for Pose Estimation Networks using Sensor-Based Domain Randomization

翻译：利用感光器主域随机化缩小粒子估计网络的现实差距

Frederik Hagelskjaer,Anders Glent Buch

from arxiv, 10 pages, 5 figures, 5 tables

Since the introduction of modern deep learning methods for object pose estimation, test accuracy and efficiency has increased significantly. For training, however, large amounts of annotated training data are required for good performance. While the use of synthetic training data prevents the need for manual annotation, there is currently a large performance gap between methods trained on real and synthetic data. This paper introduces a new method, which bridges this gap. Most methods trained on synthetic data use 2D images, as domain randomization in 2D is more developed. To obtain precise poses, many of these methods perform a final refinement using 3D data. Our method integrates the 3D data into the network to increase the accuracy of the pose estimation. To allow for domain randomization in 3D, a sensor-based data augmentation has been developed. Additionally, we introduce the SparseEdge feature, which uses a wider search space during point cloud propagation to avoid relying on specific features without increasing run-time. Experiments on three large pose estimation benchmarks show that the presented method outperforms previous methods trained on synthetic data and achieves comparable results to existing methods trained on real data.

翻译：自采用现代深入的物体估计方法以来,测试准确性和效率有了显著提高。但是,对于培训来说,需要大量的附加说明的培训数据才能取得良好的业绩。使用合成培训数据可以避免人工说明的必要性,但目前实际数据和合成数据培训方法之间存在很大的性能差距。本文介绍了一种弥合这一差距的新方法。大多数关于合成数据的培训方法使用2D图象,因为2D中的域随机化比较发达。为了获得精确的成份,许多这些方法使用3D数据进行最后的改进。我们的方法将3D数据纳入网络,以提高组合估算的准确性。为了允许3D中的域随机化,还开发了基于传感器的数据增强功能。此外,我们引入了在点云传播期间使用更广泛的搜索空间以避免在不增加运行时间的情况下依赖特定特征的SprassEdge特征。对三种大型的估算基准的实验表明,所提出的方法比以前训练的合成数据方法要好,并取得与实际数据培训的现有方法相近的结果。

0

相关内容

估计/估计量

估计/估计量

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

专知会员服务

32+阅读 · 2020年4月26日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

专知会员服务

34+阅读 · 2020年2月27日

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

专知会员服务

28+阅读 · 2019年11月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】CReaM: 使用卷积网络估计深度的浓缩实时模型

【泡泡一分钟】CReaM: 使用卷积网络估计深度的浓缩实时模型

泡泡机器人SLAM

5+阅读 · 2019年3月24日

【泡泡一分钟】多运动视觉里程计（MVO）：摄像机和第三方运动的同时估计

【泡泡一分钟】多运动视觉里程计（MVO）：摄像机和第三方运动的同时估计

泡泡机器人SLAM

3+阅读 · 2019年3月10日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

13+阅读 · 2019年1月16日

【泡泡一分钟】用于平面环境的线性RGBD-SLAM

【泡泡一分钟】用于平面环境的线性RGBD-SLAM

泡泡机器人SLAM

6+阅读 · 2018年12月18日

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

泡泡机器人SLAM

17+阅读 · 2018年10月12日

(TensorFlow)实时语义分割比较研究

(TensorFlow)实时语义分割比较研究

机器学习研究会

9+阅读 · 2018年3月12日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Real-time Pose Estimation from Images for Multiple Humanoid Robots

Real-time Pose Estimation from Images for Multiple Humanoid Robots

Arxiv

0+阅读 · 2021年7月6日

AdaXpert: Adapting Neural Architecture for Growing Data

Arxiv

4+阅读 · 2021年7月1日

Monocular Real-time Full Body Capture with Inter-part Correlations

Monocular Real-time Full Body Capture with Inter-part Correlations

Arxiv

9+阅读 · 2020年12月11日

Simple Multi-Resolution Representation Learning for Human Pose Estimation

Simple Multi-Resolution Representation Learning for Human Pose Estimation

Arxiv

6+阅读 · 2020年4月14日

MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation

MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation

Arxiv

7+阅读 · 2020年3月30日

Learning to Estimate Pose and Shape of Hand-Held Objects from RGB Images

Learning to Estimate Pose and Shape of Hand-Held Objects from RGB Images

Arxiv

5+阅读 · 2019年3月8日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

Domain Specific Approximation for Object Detection

Arxiv

5+阅读 · 2018年10月4日

Virtual-to-Real: Learning to Control in Visual Semantic Segmentation

Arxiv

4+阅读 · 2018年3月18日

Complex-YOLO: Real-time 3D Object Detection on Point Clouds

Arxiv

3+阅读 · 2018年3月16日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

【CVPR2020】物体实例持续学习，Continual Learning of Object Instances

专知会员服务

32+阅读 · 2020年4月26日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

专知会员服务

34+阅读 · 2020年2月27日

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

专知会员服务

28+阅读 · 2019年11月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

Deep Research（深度研究）：系统性综述

《革新战术战场空间能力：反无人机系统》报告

【普林斯顿博士论文】用于语音的生成式通用模型

螺旋式开发作为战略资产：美军启示

相关资讯

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】CReaM: 使用卷积网络估计深度的浓缩实时模型

【泡泡一分钟】CReaM: 使用卷积网络估计深度的浓缩实时模型

泡泡机器人SLAM

5+阅读 · 2019年3月24日

【泡泡一分钟】多运动视觉里程计（MVO）：摄像机和第三方运动的同时估计

【泡泡一分钟】多运动视觉里程计（MVO）：摄像机和第三方运动的同时估计

泡泡机器人SLAM

3+阅读 · 2019年3月10日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

【泡泡一分钟】LIMO：激光和单目相机融合的视觉里程计

泡泡机器人SLAM

13+阅读 · 2019年1月16日

【泡泡一分钟】用于平面环境的线性RGBD-SLAM

【泡泡一分钟】用于平面环境的线性RGBD-SLAM

泡泡机器人SLAM

6+阅读 · 2018年12月18日

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

【泡泡一分钟】SSD6D：基于RGB的三维检测和6自由度位姿估计(ICCV2017-159)

泡泡机器人SLAM

17+阅读 · 2018年10月12日

(TensorFlow)实时语义分割比较研究

(TensorFlow)实时语义分割比较研究

机器学习研究会

9+阅读 · 2018年3月12日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Real-time Pose Estimation from Images for Multiple Humanoid Robots

Real-time Pose Estimation from Images for Multiple Humanoid Robots

Arxiv

0+阅读 · 2021年7月6日

AdaXpert: Adapting Neural Architecture for Growing Data

Arxiv

4+阅读 · 2021年7月1日

Monocular Real-time Full Body Capture with Inter-part Correlations

Monocular Real-time Full Body Capture with Inter-part Correlations

Arxiv

9+阅读 · 2020年12月11日

Simple Multi-Resolution Representation Learning for Human Pose Estimation

Simple Multi-Resolution Representation Learning for Human Pose Estimation

Arxiv

6+阅读 · 2020年4月14日

MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation

MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation

Arxiv

7+阅读 · 2020年3月30日

Learning to Estimate Pose and Shape of Hand-Held Objects from RGB Images

Learning to Estimate Pose and Shape of Hand-Held Objects from RGB Images

Arxiv

5+阅读 · 2019年3月8日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

Domain Specific Approximation for Object Detection

Arxiv

5+阅读 · 2018年10月4日

Virtual-to-Real: Learning to Control in Visual Semantic Segmentation

Arxiv

4+阅读 · 2018年3月18日

Complex-YOLO: Real-time 3D Object Detection on Point Clouds

Arxiv

3+阅读 · 2018年3月16日

微信扫码咨询专知VIP会员