Modern mobile neural networks with a reduced number of weights and parameters do a good job with image classification tasks, but even they may be too complex to be implemented in an FPGA for video processing tasks. The article proposes neural network architecture for the practical task of recognizing images from a camera, which has several advantages in terms of speed. This is achieved by reducing the number of weights, moving from a floating-point to a fixed-point arithmetic, and due to a number of hardware-level optimizations associated with storing weights in blocks, a shift register, and an adjustable number of convolutional blocks that work in parallel. The article also proposed methods for adapting the existing data set for solving a different task. As the experiments showed, the proposed neural network copes well with real-time video processing even on the cheap FPGAs.
翻译:现代移动神经网络,其重量和参数数量减少,在图像分类任务方面效果良好,但即使这些网络可能过于复杂,无法在视频处理任务FPGA中实施。文章提议神经网络结构,以实际任务为目的,即从照相机中识别图像,这在速度方面有若干优势。通过减少重量数量,从浮动点向固定点算术转变,以及由于一些硬件级优化,与区块中储存重量、转移登记册和可调整的平行运行的卷积块有关。文章还提出了调整现有数据集以解决不同任务的方法。实验显示,拟议的神经网络即使在廉价的FPGA上也能很好地应对实时视频处理。