By learning a sequence of tasks continually, an agent in continual learning (CL) can improve the learning performance of both a new task and `old' tasks by leveraging the forward knowledge transfer and the backward knowledge transfer, respectively. However, most existing CL methods focus on addressing catastrophic forgetting in neural networks by minimizing the modification of the learnt model for old tasks. This inevitably limits the backward knowledge transfer from the new task to the old tasks, because judicious model updates could possibly improve the learning performance of the old tasks as well. To tackle this problem, we first theoretically analyze the conditions under which updating the learnt model of old tasks could be beneficial for CL and also lead to backward knowledge transfer, based on the gradient projection onto the input subspaces of old tasks. Building on the theoretical analysis, we next develop a ContinUal learning method with Backward knowlEdge tRansfer (CUBER), for a fixed capacity neural network without data replay. In particular, CUBER first characterizes the task correlation to identify the positively correlated old tasks in a layer-wise manner, and then selectively modifies the learnt model of the old tasks when learning the new task. Experimental studies show that CUBER can even achieve positive backward knowledge transfer on several existing CL benchmarks for the first time without data replay, where the related baselines still suffer from catastrophic forgetting (negative backward knowledge transfer). The superior performance of CUBER on the backward knowledge transfer also leads to higher accuracy accordingly.
翻译:通过不断学习一系列任务,持续学习的推动者(CL)可以通过利用前方知识转移和后向知识转移,提高新任务和“老”任务的学习业绩,但是,大多数现有的CL方法侧重于解决神经网络中的灾难性遗忘问题,尽量减少对旧任务所学模式的修改。这不可避免地限制了从新任务到旧任务的后向知识转移,因为明智的模型更新可能提高旧任务的学习业绩。为了解决这一问题,我们首先从理论上分析更新旧任务所学模式对CL有益,并导致后向知识转移的条件,其基础是向旧任务投入的子空间预测梯度。在理论分析的基础上,我们接下来开发一个连续学习方法,与后向知识模式的模型进行修改,以建立固定的能力神经网络,而没有数据重现。特别是,CURBER首先描述任务的相关性,以分层方式确定与旧任务正相关的旧任务,然后有选择地修改旧任务学习的后向后向知识转移模式,在学习新的CL任务后向后向基准时,实验性研究还显示CL的后向后向基准。