Object detection is considered one of the most challenging problems in this field of computer vision, as it involves the combination of object classification and object localization within a scene. Recently, deep neural networks (DNNs) have been demonstrated to achieve superior object detection performance compared to other approaches, with YOLOv2 (an improved You Only Look Once model) being one of the state-of-the-art in DNN-based object detection methods in terms of both speed and accuracy. Although YOLOv2 can achieve real-time performance on a powerful GPU, it still remains very challenging for leveraging this approach for real-time object detection in video on embedded computing devices with limited computational power and limited memory. In this paper, we propose a new framework called Fast YOLO, a fast You Only Look Once framework which accelerates YOLOv2 to be able to perform object detection in video on embedded devices in a real-time manner. First, we leverage the evolutionary deep intelligence framework to evolve the YOLOv2 network architecture and produce an optimized architecture (referred to as O-YOLOv2 here) that has 2.8X fewer parameters with just a ~2% IOU drop. To further reduce power consumption on embedded devices while maintaining performance, a motion-adaptive inference method is introduced into the proposed Fast YOLO framework to reduce the frequency of deep inference with O-YOLOv2 based on temporal motion characteristics. Experimental results show that the proposed Fast YOLO framework can reduce the number of deep inferences by an average of 38.13%, and an average speedup of ~3.3X for objection detection in video compared to the original YOLOv2, leading Fast YOLO to run an average of ~18FPS on a Nvidia Jetson TX1 embedded system.
翻译:以 DNN为基础的天体探测方法在速度和准确性方面都是最先进的问题之一。 YOLOv2 可以在强大的 GPU 上实现实时性能,但它仍然非常具有挑战性。 最近,深的神经网络(DNNS)被证明能够与其他方法相比实现优异的天体探测性能, YOLOV2 (改进的“你只看一眼”模型)是DNN的天体探测方法中速度和准确性方面最先进的之一。 虽然 YOLOv2 可以在强大的 GPU上实现实时性能, 但它仍然非常具有挑战性, 利用这一方法在具有有限计算能力和有限内存的嵌计算机设备上实时检测物体。 最近, 我们提出了一个新的框架,称为“快的 YOLOOO, 快速一模 ”, 加速YOLOv2 的视频探测方法, 我们利用进化的深深深层智能智能框架来发展YOLOV2 网络结构(这里称为OOLOVO2) 的实时实时实时图像。, 将O 的最小性动力显示一个快速性O2 系统, 的最小性观测中的拟议速度框架将降低值降低O2 的反值降低。