Neural networks are prone to catastrophic forgetting when trained incrementally on different tasks. In order to prevent forgetting, most existing methods retain a small subset of previously seen samples, which in turn can be used for joint training with new tasks. While this is indeed effective, it may not always be possible to store such samples, e.g., due to data protection regulations. In these cases, one can instead employ generative models to create artificial samples or features representing memories from previous tasks. Following a similar direction, we propose GenIFeR (Generative Implicit Feature Replay) for class-incremental learning. The main idea is to train a generative adversarial network (GAN) to generate images that contain realistic features. While the generator creates images at full resolution, the discriminator only sees the corresponding features extracted by the continually trained classifier. Since the classifier compresses raw images into features that are actually relevant for classification, the GAN can match this target distribution more accurately. On the other hand, allowing the generator to create full resolution images has several benefits: In contrast to previous approaches, the feature extractor of the classifier does not have to be frozen. In addition, we can employ augmentations on generated images, which not only boosts classification performance, but also mitigates discriminator overfitting during GAN training. We empirically show that GenIFeR is superior to both conventional generative image and feature replay. In particular, we significantly outperform the state-of-the-art in generative replay for various settings on the CIFAR-100 and CUB-200 datasets.
翻译:神经网络在接受不同任务培训时很容易被灾难性地遗忘。 为了防止忘记, 大多数现有方法都保留了一小部分先前看到的样本, 而这些样本可以用来进行与新任务有关的联合培训。 虽然这确实有效, 但由于数据保护条例等原因, 可能并不总是能够存储这些样本。 在这些情况下, 可以用基因化模型来创建人工样本或代表先前任务记忆的特征。 遵循一个类似的方向, 我们建议GenIFere( General Indictural Expecial Replay) 来进行课堂强化学习。 主要的想法是训练基因化的对立网络( GAN) 来生成包含现实特征的图像。 虽然这确实有效, 歧视者可能并不总是能够存储这些样本, 例如由于数据被持续培训的分类器将原始图像压缩成与常规任务相关的特征, GAN 能够更准确地匹配此目标分布。 在另一方面, 允许发电机创建完整的分辨率图像有几种好处 : 与以前的方法不同, 分类的特征提取器在100 的对基因结构的设置中, 并没有显著的升级, 而我们只能用GIRC 的特性来进行升级 。