This paper proposes a self-supervised learned local detector and descriptor, called EventPoint, for event stream/camera tracking and registration. Event-based cameras have grown in popularity because of their biological inspiration and low power consumption. Despite this, applying local features directly to the event stream is difficult due to its peculiar data structure. We propose a new time-surface-like event stream representation method called Tencode. The event stream data processed by Tencode can obtain the pixel-level positioning of interest points while also simultaneously extracting descriptors through a neural network. Instead of using costly and unreliable manual annotation, our network leverages the prior knowledge of local feature extraction on color images and conducts self-supervised learning via homographic and spatio-temporal adaptation. To the best of our knowledge, our proposed method is the first research on event-based local features learning using a deep neural network. We provide comprehensive experiments of feature point detection and matching, and three public datasets are used for evaluation (i.e. DSEC, N-Caltech101, and HVGA ATIS Corner Dataset). The experimental findings demonstrate that our method outperforms SOTA in terms of feature point detection and description.
翻译:本文建议了一种自监督的本地已知探测器和描述器,称为“事件点”,用于事件流/摄像机跟踪和登记。事件相机由于其生物灵感和低耗力,越来越受欢迎。尽管如此,直接对事件流应用本地特征由于特殊的数据结构而困难重重。我们建议了一种新的时间表表样事件流代表法,称为“天码”。Tencode处理的事件流数据可以通过神经网络获得对感兴趣的点进行像素级定位,同时通过神经网络提取描述器。我们的网络使用昂贵和不可靠的人工注解,而不是利用以前对颜色图像的本地特征提取知识,并且通过感光学和阵列-时空适应进行自我监督学习。我们最了解的是,我们建议的方法是利用深层神经网络对基于事件的本地特征学习进行首次研究。我们提供了特征点检测和匹配的全面实验,并使用三个公共数据集进行评估(即DSEC、N-Caltech101和HVGA ATIS Corn数据设置特征特征描述系统)实验性结果和SO系统特征特征描述。