自相关系数逆变异系数抑制下的零样本神经架构搜索 (ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients) - 专知论文

会员服务 ·

0

零样本 · 样本 · 搜索 · 逆变 · 相关系数 ·

2023 年 4 月 12 日

ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients

翻译：自相关系数逆变异系数抑制下的零样本神经架构搜索

Guihong Li,Yuedong Yang,Kartikeya Bhardwaj,Radu Marculescu

from arxiv, ICLR 2023 Spotlight

Neural Architecture Search (NAS) is widely used to automatically obtain the neural network with the best performance among a large number of candidate architectures. To reduce the search time, zero-shot NAS aims at designing training-free proxies that can predict the test performance of a given architecture. However, as shown recently, none of the zero-shot proxies proposed to date can actually work consistently better than a naive proxy, namely, the number of network parameters (#Params). To improve this state of affairs, as the main theoretical contribution, we first reveal how some specific gradient properties across different samples impact the convergence rate and generalization capacity of neural networks. Based on this theoretical analysis, we propose a new zero-shot proxy, ZiCo, the first proxy that works consistently better than #Params. We demonstrate that ZiCo works better than State-Of-The-Art (SOTA) proxies on several popular NAS-Benchmarks (NASBench101, NATSBench-SSS/TSS, TransNASBench-101) for multiple applications (e.g., image classification/reconstruction and pixel-level prediction). Finally, we demonstrate that the optimal architectures found via ZiCo are as competitive as the ones found by one-shot and multi-shot NAS methods, but with much less search time. For example, ZiCo-based NAS can find optimal architectures with 78.1%, 79.4%, and 80.4% test accuracy under inference budgets of 450M, 600M, and 1000M FLOPs, respectively, on ImageNet within 0.4 GPU days. Our code is available at https://github.com/SLDGroup/ZiCo.

翻译：神经结构搜索（NAS）被广泛用于自动获取最佳性能的神经网络，其中由大量候选结构中选择。为了减少搜索时间，零样本NAS旨在设计无需训练即可预测给定结构测试性能的代理。然而，最近的研究表明，迄今为止提出的零样本代理实际上都不能一致地比某个简单代理（即网络参数数量#Params）表现更好。为了改善这种情况，本文首先揭示了梯度在不同样本之间的某些特定性质如何影响神经网络的收敛速度和泛化能力，进而提出了一种新的零样本代理：ZiCo。这是第一种始终优于#Params的代理。我们证明了ZiCo在多个应用场景（如图像分类/重建、像素级预测）上对几个常见NAS-Benchmark（NASBench101，NATSBench-SSS/TSS，TransNASBench-101）的预测效果都优于现有技术（State-Of-The-Art，SOTA）代理。最后，我们证明了ZiCo找到的最优架构与一次/多次搜索得到的最佳性能相当，但搜索时间要少得多。例如，在0.4 GPU天的时间内，基于ZiCo的NAS可以在推理预算为450M、600M和1000M FLOPs的条件下，在ImageNet上实现78.1％、79.4％和80.4％的测试精度。我们的代码可在https://github.com/SLDGroup/ZiCo 上获得。

0

相关内容

零样本

【ICLR 2023】Zico:基于梯度变异逆系数的零样本NAS

【ICLR 2023】Zico:基于梯度变异逆系数的零样本NAS

专知会员服务

7+阅读 · 2023年1月29日

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

专知会员服务

15+阅读 · 2022年3月12日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】非凸从动件的基于梯度的双层优化

专知会员服务

13+阅读 · 2021年10月12日

【ICML2021】 One-shot 权重共享神经网络结构搜索算法

专知会员服务

18+阅读 · 2021年8月4日

【CVPR2021】用随机标签的神经架构搜索

专知会员服务

12+阅读 · 2021年3月21日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【ACL2020】TriggerNER:使用实体触发器学习作为解释用于命名实体识别

【ACL2020】TriggerNER:使用实体触发器学习作为解释用于命名实体识别

专知会员服务

23+阅读 · 2020年4月18日

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

专知会员服务

18+阅读 · 2019年11月30日

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

专知会员服务

24+阅读 · 2019年11月4日

ICML 2022 | 阿里达摩院灵瞳实验室：基于最大熵原理的目标检测搜索

ICML 2022 | 阿里达摩院灵瞳实验室：基于最大熵原理的目标检测搜索

PaperWeekly

1+阅读 · 2022年8月19日

【NeurIPS 2020】核基渐进蒸馏加法器神经网络

【NeurIPS 2020】核基渐进蒸馏加法器神经网络

专知

13+阅读 · 2020年10月19日

【NeurIPS 2019】7篇自动化神经网络搜索(NAS)论文简读

【NeurIPS 2019】7篇自动化神经网络搜索(NAS)论文简读

专知

31+阅读 · 2019年9月12日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

数据集|更大的行人重识别测试集 Market-1501+500k

数据集|更大的行人重识别测试集 Market-1501+500k

极市平台

26+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新六篇图像描述生成相关论文—字符级推断、视觉解释、语义对齐、实体感知、确定性非自回归

【论文推荐】最新六篇图像描述生成相关论文—字符级推断、视觉解释、语义对齐、实体感知、确定性非自回归

专知

15+阅读 · 2018年5月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

新型小分子64B靶向抑制眼脉络膜黑色素瘤肝转移的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

PI3K催化亚单位调控AKT活化影响非小细胞肺癌脑转移的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

液滴热毛细迁移的准定态假设适用性与稳定性研究

国家自然科学基金

0+阅读 · 2014年12月31日

HOXB-AS3/HOXB7/PAK4信号轴调控结直肠癌侵袭转移的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

组蛋白乙酰基转移酶PCAF通过乙酰化CDK4抑制胃癌增殖的研究

国家自然科学基金

0+阅读 · 2013年12月31日

选择性杀伤肺癌细胞的miRNA的筛选和功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

功率变换器非线性不稳定行为的washout滤波器控制方法

国家自然科学基金

0+阅读 · 2012年12月31日

拟Frobenius-Lusztig核

国家自然科学基金

0+阅读 · 2012年12月31日

肿瘤细胞中凋亡抑制蛋白CFLAR乙酰化调控的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

超过程及相关SPDE的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Efficient PDE-Constrained optimization under high-dimensional uncertainty using derivative-informed neural operators

Arxiv

0+阅读 · 2023年5月31日

Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training

Arxiv

0+阅读 · 2023年5月31日

Exploring Partial Knowledge Base Inference in Biomedical Entity Linking

Arxiv

0+阅读 · 2023年5月31日

Efficient Training of Energy-Based Models Using Jarzynski Equality

Arxiv

0+阅读 · 2023年5月30日

A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation

Arxiv

0+阅读 · 2023年5月30日

Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking

Arxiv

0+阅读 · 2023年5月30日

Coherent Soft Imitation Learning

Arxiv

0+阅读 · 2023年5月29日

Full Stack Optimization of Transformer Inference: a Survey

Arxiv

19+阅读 · 2023年2月27日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

28+阅读 · 2021年6月16日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

VIP会员

文章信息

相关主题

相关VIP内容

【ICLR 2023】Zico:基于梯度变异逆系数的零样本NAS

【ICLR 2023】Zico:基于梯度变异逆系数的零样本NAS

专知会员服务

7+阅读 · 2023年1月29日

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

专知会员服务

15+阅读 · 2022年3月12日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】非凸从动件的基于梯度的双层优化

专知会员服务

13+阅读 · 2021年10月12日

【ICML2021】 One-shot 权重共享神经网络结构搜索算法

专知会员服务

18+阅读 · 2021年8月4日

【CVPR2021】用随机标签的神经架构搜索

专知会员服务

12+阅读 · 2021年3月21日

【Google】平滑对抗训练，Smooth Adversarial Training

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【ACL2020】TriggerNER:使用实体触发器学习作为解释用于命名实体识别

【ACL2020】TriggerNER:使用实体触发器学习作为解释用于命名实体识别

专知会员服务

23+阅读 · 2020年4月18日

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

【论文|Google】基于元学习的排序架构，Ranking architectures using meta-learning

专知会员服务

18+阅读 · 2019年11月30日

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

专知会员服务

24+阅读 · 2019年11月4日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

ICML 2022 | 阿里达摩院灵瞳实验室：基于最大熵原理的目标检测搜索

ICML 2022 | 阿里达摩院灵瞳实验室：基于最大熵原理的目标检测搜索

PaperWeekly

1+阅读 · 2022年8月19日

【NeurIPS 2020】核基渐进蒸馏加法器神经网络

【NeurIPS 2020】核基渐进蒸馏加法器神经网络

专知

13+阅读 · 2020年10月19日

【NeurIPS 2019】7篇自动化神经网络搜索(NAS)论文简读

【NeurIPS 2019】7篇自动化神经网络搜索(NAS)论文简读

专知

31+阅读 · 2019年9月12日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

数据集|更大的行人重识别测试集 Market-1501+500k

数据集|更大的行人重识别测试集 Market-1501+500k

极市平台

26+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新六篇图像描述生成相关论文—字符级推断、视觉解释、语义对齐、实体感知、确定性非自回归

【论文推荐】最新六篇图像描述生成相关论文—字符级推断、视觉解释、语义对齐、实体感知、确定性非自回归

专知

15+阅读 · 2018年5月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Efficient PDE-Constrained optimization under high-dimensional uncertainty using derivative-informed neural operators

Arxiv

0+阅读 · 2023年5月31日

Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training

Arxiv

0+阅读 · 2023年5月31日

Exploring Partial Knowledge Base Inference in Biomedical Entity Linking

Arxiv

0+阅读 · 2023年5月31日

Efficient Training of Energy-Based Models Using Jarzynski Equality

Arxiv

0+阅读 · 2023年5月30日

A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation

Arxiv

0+阅读 · 2023年5月30日

Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking

Arxiv

0+阅读 · 2023年5月30日

Coherent Soft Imitation Learning

Arxiv

0+阅读 · 2023年5月29日

Full Stack Optimization of Transformer Inference: a Survey

Arxiv

19+阅读 · 2023年2月27日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

28+阅读 · 2021年6月16日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

相关基金

新型小分子64B靶向抑制眼脉络膜黑色素瘤肝转移的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

PI3K催化亚单位调控AKT活化影响非小细胞肺癌脑转移的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

液滴热毛细迁移的准定态假设适用性与稳定性研究

国家自然科学基金

0+阅读 · 2014年12月31日

HOXB-AS3/HOXB7/PAK4信号轴调控结直肠癌侵袭转移的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

组蛋白乙酰基转移酶PCAF通过乙酰化CDK4抑制胃癌增殖的研究

国家自然科学基金

0+阅读 · 2013年12月31日

选择性杀伤肺癌细胞的miRNA的筛选和功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

功率变换器非线性不稳定行为的washout滤波器控制方法

国家自然科学基金

0+阅读 · 2012年12月31日

拟Frobenius-Lusztig核

国家自然科学基金

0+阅读 · 2012年12月31日

肿瘤细胞中凋亡抑制蛋白CFLAR乙酰化调控的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

超过程及相关SPDE的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员