Recent progress in large-scale language models has enabled breakthroughs in previously intractable computer programming tasks. Prior work in meta-learning and neural architecture search has led to substantial successes across various task domains, spawning myriad approaches for algorithmically optimizing the design and learning dynamics of deep learning models. At the intersection of these research areas, we implement a code-generating language model with the ability to modify its own source code. Self-programming AI algorithms have been of interest since the dawn of AI itself. Although various theoretical formulations of generalized self-programming AI have been posed, no such system has been successfully implemented to date under real-world computational constraints. Applying AI-based code generation to AI itself, we develop and experimentally validate the first practical implementation of a self-programming AI system. We empirically show that a self-programming AI implemented using a code generation model can successfully modify its own source code to improve performance and program sub-models to perform auxiliary tasks. Our model can self-modify various properties including model architecture, computational capacity, and learning dynamics.
翻译:大规模语言模型的最近进展使得先前难以解决的计算机编程任务的突破得以实现; 元学习和神经结构的先前工作在各种任务领域取得了巨大成功,产生了从逻辑上优化深层学习模型的设计和学习动态的各种方法; 在这些研究领域的交叉点上,我们实施了一个代码生成语言模型,有能力修改自己的源代码; 自AI的算法自大赦国际成立以来就一直受到关注。 尽管已经提出了各种自编自编自编的理论公式,但迄今为止还没有在现实世界的计算限制下成功实施过这种系统。 将基于AI的代码生成应用到AI本身,我们开发和实验性地验证了自编自编自编的AI系统的首次实际实施。 我们的经验显示,使用代码生成模型实施的自编自编自编的AI能够成功地修改自己的源代码,以改进性能和编程子模型来完成辅助任务。 我们的模型可以自我调整各种特性,包括模型结构、计算能力和学习动态。