The tremendous potential exhibited by deep learning is often offset by architectural and computational complexity, making widespread deployment a challenge for edge scenarios such as mobile and other consumer devices. To tackle this challenge, we explore the following idea: Can we learn generative machines to automatically generate deep neural networks with efficient network architectures? In this study, we introduce the idea of generative synthesis, which is premised on the intricate interplay between a generator-inquisitor pair that work in tandem to garner insights and learn to generate highly efficient deep neural networks that best satisfies operational requirements. What is most interesting is that, once a generator has been learned through generative synthesis, it can be used to generate not just one but a large variety of different, unique highly efficient deep neural networks that satisfy operational requirements. Experimental results for image classification, semantic segmentation, and object detection tasks illustrate the efficacy of generative synthesis in producing generators that automatically generate highly efficient deep neural networks (which we nickname FermiNets) with higher model efficiency and lower computational costs (reaching >10x more efficient and fewer multiply-accumulate operations than several tested state-of-the-art networks), as well as higher energy efficiency (reaching >4x improvements in image inferences per joule consumed on a Nvidia Tegra X2 mobile processor). As such, generative synthesis can be a powerful, generalized approach for accelerating and improving the building of deep neural networks for on-device edge scenarios.
翻译:深层学习所展示的巨大潜力往往被建筑和计算的复杂性所抵消,使广泛部署成为移动和其他消费设备等边缘情景的挑战。为了应对这一挑战,我们探讨以下想法:我们能否学习基因化机器,以自动生成高效网络结构的深神经网络?在本研究中,我们引入基因合成理念,其前提是发电机-询问者对配对之间错综复杂的相互作用,这些对配对协同工作,以获得洞察力和学习产生效率高、最能满足业务要求的高效深神经网络。最有趣的是,一旦通过基因合成学习了发电机,它不仅可以用来产生一种而且可以产生大量不同的、独特的高效的深神经网络,满足操作要求。图像分类、语义分解和物体检测任务的实验结果表明,在生成能自动产生效率高的深神经网络(我们称之为FermiNet),其模型效率更高,计算成本更低(达到>10和倍增倍增运行运行速度比几个经过测试的精细的神经网络更精细的精细神经网络),可以用来在不断加速的图像系统中提高速度。