3D object detection has a pivotal role in a wide range of applications, most notably autonomous driving and robotics. These applications are commonly deployed on edge devices to promptly interact with the environment, and often require near real-time response. With limited computation power, it is challenging to execute 3D detection on the edge using highly complex neural networks. Common approaches such as offloading to the cloud brings latency overheads due to the large amount of 3D point cloud data during transmission. To resolve the tension between wimpy edge devices and compute-intensive inference workloads, we explore the possibility of transforming fast 2D detection results to extrapolate 3D bounding boxes. To this end, we present Moby, a novel system that demonstrates the feasibility and potential of our approach. Our main contributions are two-fold: First, we design a 2D-to-3D transformation pipeline that takes as input the point cloud data from LiDAR and 2D bounding boxes from camera that are captured at exactly the same time, and generate 3D bounding boxes efficiently and accurately based on detection results of the previous frames without running 3D detectors. Second, we design a frame offloading scheduler that dynamically launches a 3D detection when the error of 2D-to-3D transformation accumulates to a certain level, so the subsequent transformations can draw upon the latest 3D detection results with better accuracy. Extensive evaluation on NVIDIA Jetson TX2 with the autonomous driving dataset KITTI and real-world 4G/LTE traces shows that, Moby reduces the end-to-end latency by up to 91.9% with mild accuracy drop compared to baselines. Further, Moby shows excellent energy efficiency by saving power consumption and memory footprint up to 75.7% and 48.1%, respectively.
翻译:3D 对象检测在广泛的应用中具有关键作用, 最明显的是自主驱动和机器人。 这些应用通常在边缘设备上部署, 以便与环境迅速互动, 并经常需要近实时反应。 由于计算能力有限, 使用高度复杂的神经网络在边缘执行三维检测具有挑战性。 通常的方法, 如向云端卸载, 将3D点云数据从传输过程中的大量 3D点云数据输入延缓性间接费用。 为了解决微弱边缘装置和计算密集48度推断工作量之间的紧张关系, 我们探索将快速二D检测结果转换为外推 3D 捆绑框的可能性。 至此端, 我们展示了莫比, 一个展示我们方法可行性和潜力的新系统。 我们的主要贡献是两重 : 首先, 我们设计一个 2D 到 3DD 转换管道, 将点云数据输入在同一时间分别捕获的相机的点云数据, 并生成 3TED 缩放框, 以高效和准确性存储前一框架的检测结果来降低 3D 。 第二, 我们设计一个显示一个更精确的驱动变动的驱动到2G 数据, 向一个更精度 的 。 将一个更精度 的 向一个更精确的变动的显示 。 在 3D 3D