Stereo matching is a critical task for robot navigation and autonomous vehicles, providing the depth estimation of surroundings. Among all stereo matching algorithms, Efficient Large-scale Stereo (ELAS) offers one of the best tradeoffs between efficiency and accuracy. However, due to the inherent iterative process and unpredictable memory access pattern, ELAS can only run at 1.5-3 fps on high-end CPUs and difficult to achieve real-time performance on low-power platforms. In this paper, we propose an energy-efficient architecture for real-time ELAS-based stereo matching on FPGA platform. Moreover, the original computational-intensive and irregular triangulation module is reformed in a regular manner with points interpolation, which is much more hardware-friendly. Optimizations, including memory management, parallelism, and pipelining, are further utilized to reduce memory footprint and improve throughput. Compared with Intel i7 CPU and the state-of-the-art CPU+FPGA implementation, our FPGA realization achieves up to 38.4x and 3.32x frame rate improvement, and up to 27.1x and 1.13x energy efficiency improvement, respectively.
翻译:对机器人导航和自主车辆来说,制式匹配是一项关键任务,可以提供对周围环境的深度估计。在所有立体匹配算法中,高效大型立体系统(ELAS)提供了效率与准确性的最佳权衡,然而,由于固有的迭代过程和不可预测的内存存访问模式,拉美证系统只能运行在高端CPU上1.5-3英尺,在低功率平台上很难实现实时性能。在本文中,我们提议为实时ECUS在FPGA平台上的立体匹配建立一个节能结构。此外,对原计算密集度和不规则的三角组合模块进行了定期改革,并配有更方便硬件的点间插。优化,包括记忆管理、平行和管道,被进一步用于减少记忆足迹并改进吞吐量。与Intel i7CPU和州级CPU+FPGA相比,我们的FGA实现达到38.4x和3.32x框架速率改进,以及分别达到27.1x和1.13x能效。