Prior works on event-based optical flow estimation have investigated several gradient-based learning methods to train neural networks for predicting optical flow. However, they do not utilize the fast data rate of event data streams and rely on a spatio-temporal representation constructed from a collection of events over a fixed period of time (often between two grayscale frames). As a result, optical flow is only evaluated at a frequency much lower than the rate data is produced by an event-based camera, leading to a temporally sparse optical flow estimation. To predict temporally dense optical flow, we cast the problem as a sequential learning task and propose a training methodology to train sequential networks for continuous prediction on an event stream. We propose two types of networks: one focused on performance and another focused on compute efficiency. We first train long-short term memory networks (LSTMs) on the DSEC dataset and demonstrated 10x temporally dense optical flow estimation over existing flow estimation approaches. The additional benefit of having a memory to draw long temporal correlations back in time results in a 19.7% improvement in flow prediction accuracy of LSTMs over similar networks with no memory elements. We subsequently show that the inherent recurrence of spiking neural networks (SNNs) enables them to learn and estimate temporally dense optical flow with 31.8% lesser parameters than LSTM, but with a slightly increased error. This demonstrates potential for energy-efficient implementation of fast optical flow prediction using SNNs.
翻译:以往关于事件光学流估算的工程调查了若干基于梯度的学习方法,以培训神经网络,预测光学流;然而,它们并没有使用事件数据流的快速数据速率,而依赖于在固定时间段内(通常介于两个灰度框架之间)收集事件而形成的时空代表。因此,光流仅以比以事件为基础的相机产生的速率数据低得多的频率进行评估,从而导致对光流进行时间性分散的光流估计。为了预测时间密集的光流,我们将此问题作为一个连续学习任务,并提出培训方法,以培训连续网络,对事件流进行连续预测。我们提出了两种类型的网络:一种侧重于性能,另一种侧重于计算效率。我们首先在DSEC数据集上培训长期短期内存储网络(LSTMs),并演示对现有流量估计方法的10x时间性密集光流估计。记忆在时间上回溯溯延时间,其结果是19.7%的LSTMMS系统在类似网络上预测流量的准确性预测准确度,而没有记忆流值增加。我们随后用快速的流值网络来显示其内在的反复性循环,而感光性循环的能量值为31的频率学习,我们随后显示它们学习的频率的频率流学流学流学的频率的频率的频率为较低。