This paper presents novel reconfigurable architectures for reducing the latency of recurrent neural networks (RNNs) that are used for detecting gravitational waves. Gravitational interferometers such as the LIGO detectors capture cosmic events such as black hole mergers which happen at unknown times and of varying durations, producing time-series data. We have developed a new architecture capable of accelerating RNN inference for analyzing time-series data from LIGO detectors. This architecture is based on optimizing the initiation intervals (II) in a multi-layer LSTM (Long Short-Term Memory) network, by identifying appropriate reuse factors for each layer. A customizable template for this architecture has been designed, which enables the generation of low-latency FPGA designs with efficient resource utilization using high-level synthesis tools. The proposed approach has been evaluated based on two LSTM models, targeting a ZYNQ 7045 FPGA and a U250 FPGA. Experimental results show that with balanced II, the number of DSPs can be reduced up to 42% while achieving the same IIs. When compared to other FPGA-based LSTM designs, our design can achieve about 4.92 to 12.4 times lower latency.
翻译:本文介绍了用于探测引力波的经常性神经网络(RNNS)潜伏性的新结构。LIGO探测器等引力干涉仪收集宇宙事件,如在不为人知的时间和不同时间段发生的黑洞合并,生成时间序列数据。我们开发了一个新的结构,能够加速RNN推断分析LIGO探测器的时间序列数据。这一结构的基础是优化多层LSTM(短期内存)网络的启动间隔(II),为每一层确定适当的再利用因素。已经设计了一个可定制的这一结构模板,能够生成低纬度的FPGA设计,利用高层次合成工具高效利用资源。已经根据两个LSTM模型,针对ZYNQ 7045 FPGA和U250 FPGA。实验结果显示,在平衡二号网络中,DSPTM的数量可以减少到42 %,同时实现相同的II。与我们基于LAS的12度设计相比,可以达到12度的LA-S。