Deep neural networks struggle to continually learn multiple sequential tasks due to catastrophic forgetting of previously learned tasks. Rehearsal-based methods which explicitly store previous task samples in the buffer and interleave them with the current task samples have proven to be the most effective in mitigating forgetting. However, Experience Replay (ER) does not perform well under low-buffer regimes and longer task sequences as its performance is commensurate with the buffer size. Consistency in predictions of soft-targets can assist ER in preserving information pertaining to previous tasks better as soft-targets capture the rich similarity structure of the data. Therefore, we examine the role of consistency regularization in ER framework under various continual learning scenarios. We also propose to cast consistency regularization as a self-supervised pretext task thereby enabling the use of a wide variety of self-supervised learning methods as regularizers. While simultaneously enhancing model calibration and robustness to natural corruptions, regularizing consistency in predictions results in lesser forgetting across all continual learning scenarios. Among the different families of regularizers, we find that stricter consistency constraints preserve previous task information in ER better.
翻译:深神经网络努力不断学习由于灾难性地忘记以往学到的任务而导致的多重连续任务。 以排练为基础的方法明确将先前的任务样品储存在缓冲中,并将它们与当前的任务样品隔开来,这已证明是减轻忘却的最有效办法。 但是,在低缓冲制度和较长的任务序列下,经验重放(ER)效果不佳,因为其性能与缓冲规模相称。 软目标预测的一致性有助于ER更好地保存与以前任务有关的信息,因为软目标捕捉了数据丰富的相似结构。 因此,我们审视了在各种持续学习情景下在ER框架中一致性正规化的作用。我们还建议将一致性正规化作为一种自我监督的托辞任务,从而使得能够使用多种自我监督的学习方法作为规范者。与此同时,在加强模型校准和对自然腐败的稳健度的同时,使预测的一致性导致在所有持续学习情景中减少遗忘。 在不同的规范者中,我们发现更加严格的一致性制约将先前的任务信息保存在ER的更好。