Front-end electronics equipped with high-speed digitizers are being used and proposed for future nuclear detectors. Recent literature reveals that deep learning models, especially one-dimensional convolutional neural networks, are promising when dealing with digital signals from nuclear detectors. Simulations and experiments demonstrate the satisfactory accuracy and additional benefits of neural networks in this area. However, specific hardware accelerating such models for online operations still needs to be studied. In this work, we introduce PulseDL-II, a system-on-chip (SoC) specially designed for applications of event feature (time, energy, etc.) extraction from pulses with deep learning. Based on the previous version, PulseDL-II incorporates a RISC CPU into the system structure for better functional flexibility and integrity. The neural network accelerator in the SoC adopts a three-level (arithmetic unit, processing element, neural network) hierarchical architecture and facilitates parameter optimization of the digital design. Furthermore, we devise a quantization scheme and associated implementation methods (rescale & bit-shift) for full compatibility with deep learning frameworks (e.g., TensorFlow) within a selected subset of layer types. With the current scheme, the quantization-aware training of neural networks is supported, and network models are automatically transformed into software of RISC CPU by dedicated scripts, with nearly no loss of accuracy. We validate PulseDL-II on field programmable gate arrays (FPGA). Finally, system validation is done with an experimental setup made up of a direct digital synthesis (DDS) signal generator and an FPGA development board with analog-to-digital converters (ADC). The proposed system achieved 60 ps time resolution and 0.40% energy resolution with online neural network inference at signal to noise ratio (SNR) of 47.4 dB.
翻译:最新文献显示,在处理来自核探测器的数字信号时,深层学习模型,特别是一维进化神经网络,很有希望。模拟和实验表明神经网络在这一领域的准确性和额外效益令人满意。然而,仍然需要研究为在线操作加速这种模型的具体硬件。在这项工作中,我们引入了PulseDL-II,一个专门设计用于应用事件特征(时间、能源等)的系统-芯片(SoC),从深度学习的脉冲中提取。基于以前的版本,PulseDL-II将RISC CPU纳入系统结构,以提高机能灵活性和完整性。 SoC的神经网络加速器采用了三级(电磁装置、处理元件、神经网络)级结构,便于数字设计的参数优化。此外,我们设计了一个有时间特征(时间、能量等)的质变数(缩放和位)执行方法(Sluseralalal-liver),用于与高级智能智能网络的完全兼容(Scial-ral-real-real-deal-deal-deal-de-deal-deal-deal-deal-de-de-de-de-de-ligal-de-de-de-de-de-deal-de-de-deal-listal-deal-deal-de-de-deal-deal-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-mod-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-to-i-i-to-i-i-i-i-i-i-i-