A key challenge for AI is to build embodied systems that operate in dynamically changing environments. Such systems must adapt to changing task contexts and learn continuously. Although standard deep learning systems achieve state of the art results on static benchmarks, they often struggle in dynamic scenarios. In these settings, error signals from multiple contexts can interfere with one another, ultimately leading to a phenomenon known as catastrophic forgetting. In this article we investigate biologically inspired architectures as solutions to these problems. Specifically, we show that the biophysical properties of dendrites and local inhibitory systems enable networks to dynamically restrict and route information in a context-specific manner. Our key contributions are as follows. First, we propose a novel artificial neural network architecture that incorporates active dendrites and sparse representations into the standard deep learning framework. Next, we study the performance of this architecture on two separate benchmarks requiring task-based adaptation: Meta-World, a multi-task reinforcement learning environment where a robotic agent must learn to solve a variety of manipulation tasks simultaneously; and a continual learning benchmark in which the model's prediction task changes throughout training. Analysis on both benchmarks demonstrates the emergence of overlapping but distinct and sparse subnetworks, allowing the system to fluidly learn multiple tasks with minimal forgetting. Our neural implementation marks the first time a single architecture has achieved competitive results on both multi-task and continual learning settings. Our research sheds light on how biological properties of neurons can inform deep learning systems to address dynamic scenarios that are typically impossible for traditional ANNs to solve.
翻译:AI 面临的一个关键挑战是建立在动态变化的环境中运作的体现系统。这些系统必须适应不断变化的任务背景并不断学习。尽管标准的深层次学习系统在静态基准上取得了最新结果,但它们往往在动态情景中挣扎。在这些环境中,来自多种背景的错误信号可以相互干扰,最终导致一个被称为灾难性遗忘的现象。在本篇文章中,我们调查生物启发的建筑结构,作为解决这些问题的解决方案。具体地说,我们表明,德氏体和地方抑制系统的生物物理特性使网络能够根据具体情况对信息进行动态限制和选择。我们的主要贡献如下。首先,我们提出一个新的人工神经神经网络结构,在标准的深层次学习框架中包含积极的定型和稀薄的表述。接下来,我们根据两个不同的基准研究这一结构的绩效,需要基于任务的适应:Meta-World,一个多任务强化学习环境,机器人代理必须同时学习解决各种操纵任务;以及一个持续学习基准,模型在培训过程中预测传统任务的变化。我们的主要贡献如下:对两个基准都显示出现重叠,但罕见的神经神经网络结构结构的出现,让我们的连续的学习系统无法再学习。