利用宽基线多视图多视图交通相机数据,对使用远程三维物体探测器进行微弱监督培训 (Weakly Supervised Training of Monocular 3D Object Detectors Using Wide Baseline Multi-view Traffic Camera Data)

Accurate 7DoF prediction of vehicles at an intersection is an important task for assessing potential conflicts between road users. In principle, this could be achieved by a single camera system that is capable of detecting the pose of each vehicle but this would require a large, accurately labelled dataset from which to train the detector. Although large vehicle pose datasets exist (ostensibly developed for autonomous vehicles), we find training on these datasets inadequate. These datasets contain images from a ground level viewpoint, whereas an ideal view for intersection observation would be elevated higher above the road surface. We develop an alternative approach using a weakly supervised method of fine tuning 3D object detectors for traffic observation cameras; showing in the process that large existing autonomous vehicle datasets can be leveraged for pre-training. To fine-tune the monocular 3D object detector, our method utilises multiple 2D detections from overlapping, wide-baseline views and a loss that encodes the subjacent geometric consistency. Our method achieves vehicle 7DoF pose prediction accuracy on our dataset comparable to the top performing monocular 3D object detectors on autonomous vehicle datasets. We present our training methodology, multi-view reprojection loss, and dataset.

翻译：在路口对车辆进行准确的7DoF预测是评估道路使用者之间潜在冲突的一项重要任务。原则上,可以通过一个单一的摄像系统来做到这一点,该系统能够探测到每辆车的外形,但需要有一个大型的、准确的标签数据集来训练探测器。虽然有大型的车辆构成数据集(为自主车辆进行快速开发),但我们发现关于这些数据集的培训不够充分。这些数据集包含地平面图像,而交叉观察的理想视图则会提升到公路表面以上。我们开发了一种替代方法,使用一种微弱的监控方法,对交通观察摄像机的3D物体探测器进行微调;在过程中显示现有大型自主车辆数据集可用于预先训练。为微调单望远镜3D物体探测器,我们的方法利用重叠的、宽基线视图的多维探测器进行多次2D探测,并造成亚分数几何一致性。我们的方法使7DoF车的精确性预测数据集与自动车辆数据数据集的顶部运行的3D物体探测器相近。我们目前的再培训方法。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【EMNLP 2020】融合自训练和自监督方法的无监督文本顺滑研究

专知会员服务

24+阅读 · 2020年10月18日

【CVPR2020】自监督的深度视觉测程与在线适应，Self-Supervised Deep Visual Odometry

专知会员服务

32+阅读 · 2020年5月14日

【陈天奇】TVM：端到端自动深度学习编译器，244页ppt

专知会员服务

87+阅读 · 2020年5月11日

【CVPR2020-Oral】无监督域内自适应语义分割，Unsupervised Intra-domain Adaptation

专知会员服务

71+阅读 · 2020年4月20日