We formulate the continual learning (CL) problem via dynamic programming and model the trade-off between catastrophic forgetting and generalization as a two-player sequential game. In this approach, player 1 maximizes the cost due to lack of generalization whereas player 2 minimizes the cost due to catastrophic forgetting. We show theoretically that a balance point between the two players exists for each task and that this point is stable (once the balance is achieved, the two players stay at the balance point). Next, we introduce balanced continual learning (BCL), which is designed to attain balance between generalization and forgetting and empirically demonstrate that BCL is comparable to or better than the state of the art.
翻译:我们通过动态的编程和模型将灾难性的遗忘和笼统化之间的权衡作为双人相继游戏来制定持续学习问题。在这种方法中,玩家1因缺乏概括化而最大限度地增加成本,而玩家2则尽量减少灾难性遗忘的代价。我们从理论上表明,每个任务都存在两个玩家之间的平衡点,而且这个点是稳定的(一旦达到平衡,两个玩家就停留在平衡点 ) 。 其次,我们引入平衡的连续学习(BCL ), 目的是在概括化和遗忘之间实现平衡, 并用经验证明, BCL 与艺术水平相当或更好。