Visual perception plays an important role in autonomous driving. One of the primary tasks is object detection and identification. Since the vision sensor is rich in color and texture information, it can quickly and accurately identify various road information. The commonly used technique is based on extracting and calculating various features of the image. The recent development of deep learning-based method has better reliability and processing speed and has a greater advantage in recognizing complex elements. For depth estimation, vision sensor is also used for ranging due to their small size and low cost. Monocular camera uses image data from a single viewpoint as input to estimate object depth. In contrast, stereo vision is based on parallax and matching feature points of different views, and the application of deep learning also further improves the accuracy. In addition, Simultaneous Location and Mapping (SLAM) can establish a model of the road environment, thus helping the vehicle perceive the surrounding environment and complete the tasks. In this paper, we introduce and compare various methods of object detection and identification, then explain the development of depth estimation and compare various methods based on monocular, stereo, and RDBG sensors, next review and compare various methods of SLAM, and finally summarize the current problems and present the future development trends of vision technologies.
翻译:视觉感知在自主驾驶中起着重要作用。主要任务之一是对象探测和识别。由于视觉传感器具有丰富的颜色和纹理信息,它可以快速准确地识别各种道路信息。常用技术的基础是提取和计算图像的各种特征。最近开发的深层次学习方法具有更高的可靠性和处理速度,在识别复杂要素方面有更大的优势。深度估计,视觉感知器也因其体积小和成本低而使用,图像感知仪用于测量范围;单镜照相机从单一角度使用图像数据作为估计物体深度的输入。相反,立体视觉是以对准和匹配不同观点的特征点为基础,而深层次学习的应用也进一步提高了准确性。此外,同声定位和绘图(SLAM)可以建立一个道路环境模型,从而帮助车辆了解周围环境并完成任务。在本文中,我们介绍并比较了各种物体探测和识别方法,然后解释深度估计的发展,比较基于单镜、立体和RDBG传感器的各种方法。下一次审查并比较了SLM-M(SLAM)目前各种趋势及未来发展趋势。