The design of compact deep neural networks is a crucial task to enable widespread adoption of deep neural networks in the real-world, particularly for edge and mobile scenarios. Due to the time-consuming and challenging nature of manually designing compact deep neural networks, there has been significant recent research interest into algorithms that automatically search for compact network architectures. A particularly interesting class of compact architecture search algorithms are those that are guided by baseline network architectures. Such algorithms have been shown to be significantly more computationally efficient than unguided methods. In this study, we explore the current state of compact architecture search for deep neural networks through both theoretical and empirical analysis of four different state-of-the-art compact architecture search algorithms: i) group lasso regularization, ii) variational dropout, iii) MorphNet, and iv) Generative Synthesis. We examine these methods in detail based on a number of different factors such as efficiency, effectiveness, and scalability. Furthermore, empirical evaluations are conducted to compare the efficacy of these compact architecture search algorithms across three well-known benchmark datasets. While by no means an exhaustive exploration, we hope that this study helps provide insights into the interesting state of this relatively new area of research in terms of diversity and real, tangible gains already achieved in architecture design improvements. Furthermore, the hope is that this study would help in pushing the conversation forward towards a deeper theoretical and empirical understanding where the research community currently stands in the landscape of compact architecture search for deep neural networks, and the practical challenges and considerations in leveraging such approaches for operational usage.
翻译:设计精密精密神经网络是一项至关重要的任务,有助于在现实世界中广泛采用深层神经网络,特别是边缘和移动情景。由于手工设计精密神经网络的过程耗时且具有挑战性,最近对自动搜索精密网络结构的算法有相当的研究兴趣。特别有趣的一类紧凑结构搜索算法是由基线网络结构指导的。这种算法在计算上比非指导方法效率要高得多。在本研究中,我们探索了目前通过对四种不同状态的精密结构搜索算法进行深度神经网络的理论和经验分析的状态:一) 光纤结构正规化,二) 变异性退出,三) 摩菲网络和四) 创新合成。我们根据效率、有效性和可缩放性等不同因素对这些方法进行详细研究。此外,进行实证评估是为了比较这些精细细结构在三个广为人知的基准数据集中搜索算法的功效。尽管目前没有对四种最先进的结构结构搜索方法进行彻底的分析分析,但我们希望在这种研究中能够提供这种实际的深层次的深层次的深层次的深层次研究领域,从而了解这一结构的深层次的研究成果。