Supervised Continual learning involves updating a deep neural network (DNN) from an ever-growing stream of labeled data. While most work has focused on overcoming catastrophic forgetting, one of the major motivations behind continual learning is being able to efficiently update a network with new information, rather than retraining from scratch on the training dataset as it grows over time. Despite recent continual learning methods largely solving the catastrophic forgetting problem, there has been little attention paid to the efficiency of these algorithms. Here, we study recent methods for incremental class learning and illustrate that many are highly inefficient in terms of compute, memory, and storage. Some methods even require more compute than training from scratch! We argue that for continual learning to have real-world applicability, the research community cannot ignore the resources used by these algorithms. There is more to continual learning than mitigating catastrophic forgetting.
翻译:监督连续学习涉及从不断增长的标注数据流中更新深度神经网络(DNN)。尽管大多数工作已经集中在克服灾难性遗忘上,但连续学习的主要动机之一是能够有效地更新网络,而不是在训练数据集不断增长时从头开始重新训练。尽管最近的连续学习方法在很大程度上解决了灾难性遗忘问题,但鲜有关注这些算法的效率。在这里,我们研究了最近的增量类学习方法,并说明其中许多方法在计算、内存和存储方面都非常低效。有些方法甚至需要比从头开始训练更多的计算资源!我们认为,为了使连续学习具有现实世界的适用性,研究界不能忽视这些算法所使用的资源。连续学习不仅仅是减轻灾难性遗忘问题。