自动4D: 从序列点云中学习标签 4D 对象 (Auto4D: Learning to Label 4D Objects from Sequential Point Clouds)

In the past few years we have seen great advances in 3D object detection thanks to deep learning methods. However, they typically rely on large amounts of high-quality labels to achieve good performance, which often require time-consuming and expensive work by human annotators. To address this we propose an automatic annotation pipeline that generates accurate object trajectories in 3D (ie, 4D labels) from LiDAR point clouds. Different from previous works that consider single frames at a time, our approach directly operates on sequential point clouds to combine richer object observations. The key idea is to decompose the 4D label into two parts: the 3D size of the object, and its motion path describing the evolution of the object's pose through time. More specifically, given a noisy but easy-to-get object track as initialization, our model first estimates the object size from temporally aggregated observations, and then refines its motion path by considering both frame-wise observations as well as temporal motion cues. We validate the proposed method on a large-scale driving dataset and show that our approach achieves significant improvements over the baselines. We also showcase the benefits of our approach under the annotator-in-the-loop setting.

翻译：在过去几年里,由于深层次的学习方法,我们在3D天体探测方面取得了巨大进步。然而,它们通常依赖大量高质量的标签来取得良好的性能,这往往需要人类笔记员花费大量时间和花费大量的工作。为了解决这个问题,我们提议了自动注解管道,从LIDAR点云中生成3D(ie, 4D 标签)的精确天体轨迹。与以往每次考虑单一框架的工程不同,我们的方法直接在连续点云上运行,以结合较丰富的天体观测。关键的想法是将4D标记分解成两个部分:物体的3D大小及其描述物体姿势演变的动向路径。更具体地说,由于初始化是一个吵闹但容易找到的物体轨迹,我们的模型首先从时间汇总的观测中估算物体大小,然后通过考虑框架性观测和时间运动提示来改进其运动路径。我们验证了在大规模驱动数据集上的拟议方法,并表明我们的方法在基线上取得了显著的改进。我们还演示了我们的方法的好处。

相关内容

点云

关注 48

根据激光测量原理得到的点云，包括三维坐标（XYZ）和激光反射强度（Intensity）。根据摄影测量原理得到的点云，包括三维坐标（XYZ）和颜色信息（RGB）。结合激光测量和摄影测量原理得到点云，包括三维坐标（XYZ）、激光反射强度（Intensity）和颜色信息（RGB）。在获取物体表面每个采样点的空间坐标后，得到的是一个点的集合，称之为“点云”(Point Cloud)

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

专知会员服务

36+阅读 · 2020年3月12日

MIT-深度学习Deep Learning State of the Art in 2020，87页ppt

专知会员服务

62+阅读 · 2020年2月17日