3D object detection is vital as it would enable us to capture objects' sizes, orientation, and position in the world. As a result, we would be able to use this 3D detection in real-world applications such as Augmented Reality (AR), self-driving cars, and robotics which perceive the world the same way we do as humans. Monocular 3D Object Detection is the task to draw 3D bounding box around objects in a single 2D RGB image. It is localization task but without any extra information like depth or other sensors or multiple images. Monocular 3D object detection is an important yet challenging task. Beyond the significant progress in image-based 2D object detection, 3D understanding of real-world objects is an open challenge that has not been explored extensively thus far. In addition to the most closely related studies.
翻译:3D 对象探测至关重要, 因为它将使我们能够捕捉到天体的大小、 方向和位置。 因此, 我们能够在现实应用中使用这种 3D 探测, 如增强现实( AR ) 、 自驾驶汽车和机器人, 这些应用与人类一样看待世界。 单立体 3D 对象探测是任务, 在一个 2D RGB 图像中绘制物体周围的 3D 框。 这是本地化任务, 但没有任何额外信息, 如深度或其他传感器或多个图像。 单立体 3D 对象探测是一项重要但具有挑战性的任务。 除了基于图像的 2D 对象探测取得的重大进展之外, 3D 了解现实世界物体是一个公开的挑战, 迄今尚未广泛探索。 除了最密切相关的研究之外 。