TTIA:利用早期推断和精细查询规划加快视频分析 (THIA: Accelerating Video Analytics using Early Inference and Fine-Grained Query Planning)

To efficiently process visual data at scale, researchers have proposed two techniques for lowering the computational overhead associated with the underlying deep learning models. The first approach consists of leveraging a specialized, lightweight model to directly answer the query. The second approach focuses on filtering irrelevant frames using a lightweight model and processing the filtered frames using a heavyweight model. These techniques suffer from two limitations. With the first approach, the specialized model is unable to provide accurate results for hard-to-detect events. With the second approach, the system is unable to accelerate queries focusing on frequently occurring events as the filter is unable to eliminate a significant fraction of frames in the video. In this paper, we present THIA, a video analytics system for tackling these limitations. The design of THIA is centered around three techniques. First, instead of using a cascade of models, it uses a single object detection model with multiple exit points for short-circuiting the inference. This early inference technique allows it to support a range of throughput-accuracy tradeoffs. Second, it adopts a fine-grained approach to planning and processes different chunks of the video using different exit points to meet the user's requirements. Lastly, it uses a lightweight technique for directly estimating the exit point for a chunk to lower the optimization time. We empirically show that these techniques enable THIA to outperform two state-of-the-art video analytics systems by up to 6.5X, while providing accurate results even on queries focusing on hard-to-detect events.

翻译：为了高效处理规模的视觉数据,研究人员提出了两种降低与深层学习模型相关的计算间接费用的技术。第一种方法是利用一个专门、轻量模型直接回答询问。第二种方法是利用轻量模型过滤不相关框架,用重量模型处理过滤框架。这些技术有两种局限性。第一种方法是,专门模型无法为难以探测的事件提供准确的结果。第二种方法是,由于过滤器无法消除大量视频框架,因此系统无法加快对经常发生的事件的查询。在本文件中,我们介绍一个视频分析系统,即处理这些限制的视频分析系统。THIA的设计以三种技术为中心。首先,它使用一个具有多个导出点的单一物体探测模型来缩短推断。这种早期推导出技术使得它无法支持一系列经常发生的事件,因为过滤器无法消除大量视频框架中的相当部分。在本文中,我们介绍一个视频分析系统(THTIA),一个视频分析系统(THIA),一个视频分析系统(THIA)的视频分析系统,然后用不同的图像分析模型来直接估算不同退出的输出。首先,我们用这些图像分析系统来提供一个直径的图像分析系统,然后用这些图像分析系统,然后用不同的图像分析系统来直接分析结果,然后用不同的图像分析系统来显示一个直向不同的输出。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【ACL2020】DeeBERT:动态加速BERT推理，DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

专知会员服务

21+阅读 · 2020年4月30日

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

专知会员服务

61+阅读 · 2019年12月29日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日