Current deep learning research is dominated by benchmark evaluation. A method is regarded as favorable if it empirically performs well on the dedicated test set. This mentality is seamlessly reflected in the resurfacing area of continual learning, where consecutively arriving sets of benchmark data are investigated. The core challenge is framed as protecting previously acquired representations from being catastrophically forgotten due to the iterative parameter updates. However, comparison of individual methods is nevertheless treated in isolation from real world application and typically judged by monitoring accumulated test set performance. The closed world assumption remains predominant. It is assumed that during deployment a model is guaranteed to encounter data that stems from the same distribution as used for training. This poses a massive challenge as neural networks are well known to provide overconfident false predictions on unknown instances and break down in the face of corrupted data. In this work we argue that notable lessons from open set recognition, the identification of statistically deviating data outside of the observed dataset, and the adjacent field of active learning, where data is incrementally queried such that the expected performance gain is maximized, are frequently overlooked in the deep learning era. Based on these forgotten lessons, we propose a consolidated view to bridge continual learning, active learning and open set recognition in deep neural networks. Our results show that this not only benefits each individual paradigm, but highlights the natural synergies in a common framework. We empirically demonstrate improvements when alleviating catastrophic forgetting, querying data in active learning, selecting task orders, while exhibiting robust open world application where previously proposed methods fail.
翻译:目前深层次的学习研究以基准评估为主。如果一种方法在专门的测试集上表现良好,这种方法被认为是有利的。这种心态在连续不断学习的重现领域得到无缝反映,因为不断学习领域是连续到达的基准数据组得到调查。核心挑战是保护先前获得的表达方式不会因为迭代参数更新而灾难性地被遗忘。然而,比较个别方法的做法仍然脱离现实世界应用程序,通常通过监测累积的测试数据集业绩来判断。封闭世界假设仍然占主导地位。假定在部署期间,一个模型保证能遇到来自用于培训的相同分布的数据。由于神经网络为人们所熟知的对未知事件作出过于自信的虚假预测,并在面临腐败数据的情况下崩溃,这构成了巨大的挑战。在这项工作中,我们说,从公开的确认、在统计上偏离已观测到的数据集之外的数据的识别,以及相邻的积极学习领域,数据被逐渐质疑到预期的绩效收益最大化,在深层次的学习时代,常常被忽视。根据这些被遗忘的教训,我们提议在深度应用中选择一个公开的顺序,在不断学习过程中,我们在不断学习每个共同的学习。