Because of their high temporal resolution, increased resilience to motion blur, and very sparse output, event cameras have been shown to be ideal for low-latency and low-bandwidth feature tracking, even in challenging scenarios. Existing feature tracking methods for event cameras are either handcrafted or derived from first principles but require extensive parameter tuning, are sensitive to noise, and do not generalize to different scenarios due to unmodeled effects. To tackle these deficiencies, we introduce the first data-driven feature tracker for event cameras, which leverages low-latency events to track features detected in a grayscale frame. We achieve robust performance via a novel frame attention module, which shares information across feature tracks. By directly transferring zero-shot from synthetic to real data, our data-driven tracker outperforms existing approaches in relative feature age by up to 120% while also achieving the lowest latency. This performance gap is further increased to 130% by adapting our tracker to real data with a novel self-supervision strategy.
翻译:由于其高时间分辨率、增强的抗运动模糊能力和非常稀疏的输出,事件相机已经被证明非常适合于低延迟和低带宽的特征跟踪,甚至在具有挑战性的场景下也能表现良好。现有的事件相机特征跟踪方法要么是手工制作的,要么是从第一原理导出的,但需要进行广泛的参数调整,对噪声敏感,并且由于未建模的效应,无法推广到不同的情况。为了解决这些不足,我们介绍了第一个事件相机数据驱动的特征跟踪器,利用低延迟事件跟踪灰度帧中检测到的特征。我们通过一种新颖的帧注意模块来实现鲁棒性,该模块在特征跟踪之间共享信息。通过将零样本直接从合成数据转移到真实数据,我们的数据驱动跟踪器相对特征年龄的性能比现有方法提高了高达120%,同时实现了最低延迟。通过一种新颖的自监督策略将我们的跟踪器适应于真实数据后,此性能差距进一步增加到130%。