Event cameras that asynchronously output low-latency event streams provide great opportunities for state estimation under challenging situations. Despite event-based visual odometry having been extensively studied in recent years, most of them are based on monocular and few research on stereo event vision. In this paper, we present ESVIO, the first event-based stereo visual-inertial odometry, which leverages the complementary advantages of event streams, standard images and inertial measurements. Our proposed pipeline achieves temporal tracking and instantaneous matching between consecutive stereo event streams, thereby obtaining robust state estimation. In addition, the motion compensation method is designed to emphasize the edge of scenes by warping each event to reference moments with IMU and ESVIO back-end. We validate that both ESIO (purely event-based) and ESVIO (event with image-aided) have superior performance compared with other image-based and event-based baseline methods on public and self-collected datasets. Furthermore, we use our pipeline to perform onboard quadrotor flights under low-light environments. A real-world large-scale experiment is also conducted to demonstrate long-term effectiveness. We highlight that this work is a real-time, accurate system that is aimed at robust state estimation under challenging environments.
翻译:异步输出低延迟事件流的事件相机为在挑战性环境下的状态估计提供了巨大的可能性。尽管近年来已经广泛研究了基于事件的视觉里程计,但其中大部分都是基于单目的,很少有针对立体事件视觉的研究。在本文中,我们提出了 ESVIO,它是第一个基于事件的立体视觉惯性里程计,利用事件流、标准图像和惯性测量的互补优势。我们的提出的流程实现了连续立体事件流之间的时间跟踪和瞬时匹配,从而获得了强健的状态估计。此外,运动补偿方法旨在通过 IMU 和 ESVIO 后端将每个事件变形为参考时刻的场景边缘。我们验证了纯事件和图像辅助的 ESVIO 与公共和自收集数据集上的其他基线方法相比具有卓越的性能。此外,我们使用我们的流程在低光环境下进行机载四旋翼飞行。还进行了一个现实世界的大规模实验以证明其长期有效性。我们强调,此工作是针对在具有挑战性的环境下进行强健状态估计的实时、准确系统。