EvopPropting: 代码级神经结构搜索语言模式</s> (EvoPrompting: Language Models for Code-Level Neural Architecture Search)

Given the recent impressive accomplishments of language models (LMs) for code generation, we explore the use of LMs as adaptive mutation and crossover operators for an evolutionary neural architecture search (NAS) algorithm. While NAS still proves too difficult a task for LMs to succeed at solely through prompting, we find that the combination of evolutionary prompt engineering with soft prompt-tuning, a method we term EvoPrompting, consistently finds diverse and high performing models. We first demonstrate that EvoPrompting is effective on the computationally efficient MNIST-1D dataset, where EvoPrompting produces convolutional architecture variants that outperform both those designed by human experts and naive few-shot prompting in terms of accuracy and model size. We then apply our method to searching for graph neural networks on the CLRS Algorithmic Reasoning Benchmark, where EvoPrompting is able to design novel architectures that outperform current state-of-the-art models on 21 out of 30 algorithmic reasoning tasks while maintaining similar model size. EvoPrompting is successful at designing accurate and efficient neural network architectures across a variety of machine learning tasks, while also being general enough for easy adaptation to other tasks beyond neural network design.

翻译：鉴于最近生成代码的语言模型(LMS)取得了令人印象深刻的成就,我们探索了使用LMS作为适应性突变和交叉操作操作者来进行进化神经结构搜索算法。虽然NAS仍然证明对于LMS来说一项任务过于困难,无法仅仅通过推动而取得成功,但我们发现,进化快速工程与软快速调相结合,这是一种我们称为EvoPrompting(EvoPPrompting)的方法,始终发现不同和高性能的模型。我们首先证明EvoPrompting(EvoPrompting)对于计算高效的MNIST-1D数据集是有效的,在这个数据集中,EvoPrompting(EvoPrompting)生成的演化式结构变异于由人类专家设计的变异体,在精确性和模型大小方面却鲜少见的提示。我们随后运用了我们的方法在CLRS Algorithmical Excalogising Bram) 基准上搜索图形神经网络网络网络网络。Evopress reforal commastrical redustrations redustrations for an lating dust for ful suble made dust betradustruble made dust bestrublestrual tradustrual sublestrations</s>

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日