VINNAS: 以神经网络为主的神经网络建筑搜索 (VINNAS: Variational Inference-based Neural Network Architecture Search)

In recent years, neural architecture search (NAS) has received intensive scientific and industrial interest due to its capability of finding a neural architecture with high accuracy for various artificial intelligence tasks such as image classification or object detection. In particular, gradient-based NAS approaches have become one of the more popular approaches thanks to their computational efficiency during the search. However, these methods often experience a mode collapse, where the quality of the found architectures is poor due to the algorithm resorting to choosing a single operation type for the entire network, or stagnating at a local minima for various datasets or search spaces. To address these defects, we present a differentiable variational inference-based NAS method for searching sparse convolutional neural networks. Our approach finds the optimal neural architecture by dropping out candidate operations in an over-parameterised supergraph using variational dropout with automatic relevance determination prior, which makes the algorithm gradually remove unnecessary operations and connections without risking mode collapse. The evaluation is conducted through searching two types of convolutional cells that shape the neural network for classifying different image datasets. Our method finds diverse network cells, while showing state-of-the-art accuracy with up to almost 2 times fewer non-zero parameters.

翻译：近些年来,神经结构搜索(NAS)由于能够找到对各种人工智能任务(如图像分类或对象探测)具有高度精准度的神经结构,因此获得了大量的科学和工业兴趣。特别是,基于梯度的NAS方法由于在搜索过程中的计算效率而成为更受欢迎的方法之一。然而,这些方法往往会遇到模式崩溃,因为发现的结构的质量由于通过算法选择整个网络的单一操作类型,或者由于对各种数据集或搜索空间在本地迷你模型中停滞不前,因而质量较差。为了解决这些缺陷,我们提出了一种基于不同变异的NAS方法,用于搜索稀有的卷发神经网络。我们的方法发现最佳的神经结构,通过使用具有自动相关性决定的变异性脱钩超强的超光谱将候选操作丢弃出去,使算法逐渐消除不必要的操作和连接,而不会冒险模式崩溃。评估是通过两种塑造神经网络以不同图像数据集分类的革命性细胞进行,我们的方法发现不同的网络细胞,同时显示不精确度几乎是两个时间。

相关内容

Neural Networks

关注 1651

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【南洋理工Xavier】图神经网络架构的最新进展，Graph Network Architectures，附80页ppt

专知会员服务

74+阅读 · 2020年11月6日

【阿里巴巴达摩院】TResNet: 高性能的GPU专用架构，GPU-Dedicated Architecture

专知会员服务

33+阅读 · 2020年4月1日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

基于深度学习的图像语义分割技术研究进展，Research on Progress of Image Semantic Segmentation Based on Deep Learning

专知会员服务

64+阅读 · 2020年2月16日