Recent advancements in the area of deep learning have shown the effectiveness of very large neural networks in several applications. However, as these deep neural networks continue to grow in size, it becomes more and more difficult to configure their many parameters to obtain good results. Presently, analysts must experiment with many different configurations and parameter settings, which is labor-intensive and time-consuming. On the other hand, the capacity of fully automated techniques for neural network architecture search is limited without the domain knowledge of human experts. To deal with the problem, we formulate the task of neural network architecture optimization as a graph space exploration, based on the one-shot architecture search technique. In this approach, a super-graph of all candidate architectures is trained in one-shot and the optimal neural network is identified as a sub-graph. In this paper, we present a framework that allows analysts to effectively build the solution sub-graph space and guide the network search by injecting their domain knowledge. Starting with the network architecture space composed of basic neural network components, analysts are empowered to effectively select the most promising components via our one-shot search scheme. Applying this technique in an iterative manner allows analysts to converge to the best performing neural network architecture for a given application. During the exploration, analysts can use their domain knowledge aided by cues provided from a scatterplot visualization of the search space to edit different components and guide the search for faster convergence. We designed our interface in collaboration with several deep learning researchers and its final effectiveness is evaluated with a user study and two case studies.
翻译:深层学习领域最近的进展表明,在多个应用领域,非常庞大的神经网络在多个应用领域都取得了成效。然而,随着这些深层神经网络的规模继续扩大,因此越来越难以配置许多参数以取得良好结果。目前,分析师必须试验许多不同的配置和参数设置,这种配置和参数设置是劳动密集型和耗时的。另一方面,完全自动化的神经网络结构搜索技术能力有限,没有人类专家的域域知识。为了解决这个问题,我们根据一发建筑搜索技术,将神经网络结构优化任务设计成一个图形空间探索任务。在这个方法中,对所有候选结构的超版进行一发式培训,将最佳神经网络确定为一个子图。在本文件中,我们提出了一个框架,使分析师能够有效地建立解决方案子绘图空间空间空间空间空间空间,指导网络搜索,通过输入其域域知识。从基本神经网络组成部分开始,分析员能够通过我们的一发图搜索方案有效地选择最有希望的构成部分。在迭代式搜索方法中应用这一技术,使用户最终的搜索模型分析师能够通过不同的搜索模型来进行最佳的搜索。