Advanced video analytic systems, including scene classification and object detection, have seen widespread success in various domains such as smart cities and autonomous transportation. With an ever-growing number of powerful client devices, there is incentive to move these heavy video analytics workloads from the cloud to mobile devices to achieve low latency and real-time processing and to preserve user privacy. However, most video analytic systems are heavyweight and are trained offline with some pre-defined latency or accuracy requirements. This makes them unable to adapt at runtime in the face of three types of dynamism -- the input video characteristics change, the amount of compute resources available on the node changes due to co-located applications, and the user's latency-accuracy requirements change. In this paper we introduce ApproxDet, an adaptive video object detection framework for mobile devices to meet accuracy-latency requirements in the face of changing content and resource contention scenarios. To achieve this, we introduce a multi-branch object detection kernel (layered on Faster R-CNN), which incorporates a data-driven modeling approach on the performance metrics, and a latency SLA-driven scheduler to pick the best execution branch at runtime. We couple this kernel with approximable video object tracking algorithms to create an end-to-end video object detection system. We evaluate ApproxDet on a large benchmark video dataset and compare quantitatively to AdaScale and YOLOv3. We find that ApproxDet is able to adapt to a wide variety of contention and content characteristics and outshines all baselines, e.g., it achieves 52% lower latency and 11.1% higher accuracy over YOLOv3.
翻译:高级视频分析系统,包括现场分类和物体探测,在智能城市和自主运输等不同领域都取得了广泛成功。随着强大的客户设备数量不断增加,鼓励将这些沉重的视频分析工作量从云层移到移动设备,以实现低潜值和实时处理,并保护用户隐私。然而,大多数视频分析系统都是重量级的,经过一些预先界定的衬里或准确性要求的培训,因此无法在运行时适应三种类型的动态 -- -- 输入视频特性的变化,在节点变化上可以计算的资源数量,以及用户的惯性精确度要求变化。在本文件中,我们引入了适应性视频目标检测框架,以满足在内容和资源争议情景发生变化时的准确性要求。为了实现这一点,我们引入了多列式对象检测(在快速 R-NCN 上设置的),在共置应用应用程序的节点变化中,可计算出可用在节点上的节点变化的精确性资源。我们引入了以数据驱动的透明性O值操作系统,在高清晰度测试中,我们引入了高清晰度的视频定位系统,我们通过直径定位系统,并实现了对等的图像定位,我们可以对等的轨对等的跟踪系统进行。