This work introduces an attention mechanism for image classifiers and the corresponding deep neural network (DNN) architecture, dubbed ISNet. During training, the ISNet uses segmentation targets to learn how to find the image's region of interest and concentrate its attention on it. The proposal is based on a novel concept, background relevance minimization in LRP explanation heatmaps. It can be applied to virtually any classification neural network architecture, without any extra computational cost at run-time. Capable of ignoring the background, the resulting single DNN can substitute the common pipeline of a segmenter followed by a classifier, being faster and lighter. We tested the ISNet with three applications: COVID-19 and tuberculosis detection in chest X-rays, and facial attribute estimation. The first two tasks employed mixed training databases, which fostered background bias and shortcut learning. By focusing on lungs, the ISNet reduced shortcut learning, improving generalization to external (out-of-distribution) test datasets. When training data presented background bias, the ISNet's test performance significantly surpassed standard classifiers, multi-task DNNs (performing classification and segmentation), attention-gated neural networks, Guided Attention Inference Networks, and the standard segmentation-classification pipeline. Facial attribute estimation demonstrated that ISNet could precisely focus on faces, being also applicable to natural images. ISNet presents an accurate, fast, and light methodology to ignore backgrounds and improve generalization, especially when background bias is a concern.
翻译:这项工作为图像分类者和相应的深神经网络(DNN)结构引入了一种关注机制,称为ISNet。在培训期间,ISNet使用分割目标来学习如何找到图像感兴趣的区域并集中关注该区域。该提案基于一个新概念,即LRP解释型热图中背景相关性最小化,可以应用于任何神经网络结构分类,而无需在运行时增加任何额外的计算成本。如果能够忽略背景,由此产生的单一 DNNN可以取代一个分离器的共同管道,随后是一个分类器,速度更快,更轻。我们用三种应用测试ISNet:在胸前X光中进行COVID-19和肺结核检测,以及面部属性估计。前两个任务采用混合培训数据库,促进背景偏差和捷径学习。通过侧重于肺部,ISNet缩短学习,改进外部(分配外)测试性数据集的概括化。当培训数据显示背景偏差时,ISNet测试性业绩大大超过标准分类器,多任务背景点DNNNSs(表现的分类和偏差背景),特别是在胸X射线和面属性估计中, 跟踪和直角化的SIS-CIS-CRIalimalimalimation Flaislation Flaim