VINNAS: 以神经网络为主的神经网络建筑搜索 (VINNAS: Variational Inference-based Neural Network Architecture Search)

In recent years, neural architecture search (NAS) has received intensive scientific and industrial interest due to its capability of finding a neural architecture with high accuracy for various artificial intelligence tasks such as image classification or object detection. In particular, gradient-based NAS approaches have become one of the more popular approaches thanks to their computational efficiency during the search. However, these methods often experience a mode collapse, where the quality of the found architectures is poor due to the algorithm resorting to choosing a single operation type for the entire network, or stagnating at a local minima for various datasets or search spaces. To address these defects, we present a differentiable variational inference-based NAS method for searching sparse convolutional neural networks. Our approach finds the optimal neural architecture by dropping out candidate operations in an over-parameterised supergraph using variational dropout with automatic relevance determination prior, which makes the algorithm gradually remove unnecessary operations and connections without risking mode collapse. The evaluation is conducted through searching two types of convolutional cells that shape the neural network for classifying different image datasets. Our method finds diverse network cells, while showing state-of-the-art accuracy with up to $3 \times$ fewer parameters.

翻译：近些年来,神经结构搜索(NAS)由于能够找到对各种人工智能任务(如图像分类或对象探测)具有高度精准度的神经结构,因此获得了大量的科学和工业兴趣。特别是,基于梯度的NAS方法由于在搜索过程中的计算效率而成为更受欢迎的方法之一。然而,这些方法往往会遇到模式崩溃,因为发现的结构的质量由于使用为整个网络选择单一操作类型的算法,或者由于对各种数据集或搜索空间在本地迷你模型中停滞不前,而发现神经结构的质量较差。为了解决这些缺陷,我们提出了一个基于不同变异的NAS方法,用于搜索稀有的卷发神经网络。我们的方法发现最佳的神经结构,通过使用具有自动相关性决定的变异性辍学超强测量方法将候选操作丢弃出去,从而使算法逐渐消除不必要的操作和连接,而不会出现模式崩溃的风险。评估是通过两种塑造神经网络对不同图像数据集进行分类的进化细胞来进行的。我们的方法在显示不同的网络的精确度上找到不同的网络细胞,同时显示不同的参数。

相关内容

Neural Networks

关注 1651

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

自动结构变分推理，Automatic structured variational inference

专知会员服务

41+阅读 · 2020年2月10日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

专知会员服务

18+阅读 · 2019年11月30日