Deep learning technologies have demonstrated remarkable effectiveness in a wide range of tasks, and deep learning holds the potential to advance a multitude of applications, including in edge computing, where deep models are deployed on edge devices to enable instant data processing and response. A key challenge is that while the application of deep models often incurs substantial memory and computational costs, edge devices typically offer only very limited storage and computational capabilities that may vary substantially across devices. These characteristics make it difficult to build deep learning solutions that unleash the potential of edge devices while complying with their constraints. A promising approach to addressing this challenge is to automate the design of effective deep learning models that are lightweight, require only a little storage, and incur only low computational overheads. This survey offers comprehensive coverage of studies of design automation techniques for deep learning models targeting edge computing. It offers an overview and comparison of key metrics that are used commonly to quantify the proficiency of models in terms of effectiveness, lightness, and computational costs. The survey then proceeds to cover three categories of the state-of-the-art of deep model design automation techniques: automated neural architecture search, automated model compression, and joint automated design and compression. Finally, the survey covers open issues and directions for future research.
翻译:深层学习技术在一系列广泛任务中表现出了显著的效益,深层学习技术具有推动多种应用的潜力,包括边缘计算,在边缘设备上部署深层模型,以便能够进行即时数据处理和反应。一个关键的挑战在于,深层模型的应用往往产生大量的内存和计算成本,但边缘设备通常只能提供非常有限的储存和计算能力,而这种能力在各种设备之间差别很大。这些特点使得难以建立深层学习解决方案,从而释放边缘装置的潜力,同时又符合其限制条件。应对这一挑战的一个有希望的方法是将有效的深层学习模型的设计自动化,这些模型轻度,只需要少量储存,并且仅产生低量的计算间接费用。这一调查全面覆盖了针对边缘计算机的深层学习模型设计自动化技术设计研究的范围。它提供了对通常用于量化模型在有效性、轻度和计算成本方面的熟练程度的主要指标的概述和比较。然后,调查将涵盖深层模型设计自动化结构搜索、自动模型压缩、联合自动自动设计和压缩的未来研究方向。