Continual learning protocols are attracting increasing attention from the medical imaging community. In continual environments, datasets acquired under different conditions arrive sequentially; and each is only available for a limited period of time. Given the inherent privacy risks associated with medical data, this setup reflects the reality of deployment for deep learning diagnostic radiology systems. Many techniques exist to learn continuously for image classification, and several have been adapted to semantic segmentation. Yet most struggle to accumulate knowledge in a meaningful manner. Instead, they focus on preventing the problem of catastrophic forgetting, even when this reduces model plasticity and thereon burdens the training process. This puts into question whether the additional overhead of knowledge preservation is worth it - particularly for medical image segmentation, where computation requirements are already high - or if maintaining separate models would be a better solution. We propose UNEG, a simple and widely applicable multi-model benchmark that maintains separate segmentation and autoencoder networks for each training stage. The autoencoder is built from the same architecture as the segmentation network, which in our case is a full-resolution nnU-Net, to bypass any additional design decisions. During inference, the reconstruction error is used to select the most appropriate segmenter for each test image. Open this concept, we develop a fair evaluation scheme for different continual learning settings that moves beyond the prevention of catastrophic forgetting. Our results across three regions of interest (prostate, hippocampus, and right ventricle) show that UNEG outperforms several continual learning methods, reinforcing the need for strong baselines in continual learning research.
翻译:持续学习的规程正在吸引医学成像界越来越多的关注。 在持续的环境中,在不同条件下获得的数据集会相继到达,每个数据集都是在有限的时间内提供的。 鉴于医疗数据固有的隐私风险,这种设置反映了深度学习诊断放射系统部署的现实。 许多技术存在,可以持续学习图像分类,有些技术已经适应语义分解。 多数人努力以有意义的方式积累知识。 相反,它们侧重于防止灾难性的遗忘问题, 即便这减少了模型的可塑性, 也给培训过程带来负担。 这就提出了知识保存的额外间接费用是否值得持续保存的问题, 特别是对于医学图像分割而言, 计算要求已经很高, 或者如果保持不同的模型将是一个更好的解决方案。 我们建议联合国评价小组, 一个简单和广泛适用的多模式基准, 维持每个培训阶段的分解和自动解码网络。 自动解析器是来自同一个架构, 也就是在我们的情况下, 一个完整解析的网络, 也就是一个完整解析的NU- Net, 以绕过任何更坚固的设计决定。 在推断过程中, 重建的模型将用来测试我们最公平的选择一个不同的图像, 。