This work tackles a central machine learning problem of performance degradation on out-of-distribution (OOD) test sets. The problem is particularly salient in medical imaging based diagnosis system that appears to be accurate but fails when tested in new hospitals/datasets. Recent studies indicate the system might learn shortcut and non-relevant features instead of generalizable features, so-called good features. We hypothesize that adversarial training can eliminate shortcut features whereas saliency guided training can filter out non-relevant features; both are nuisance features accounting for the performance degradation on OOD test sets. With that, we formulate a novel model training scheme for the deep neural network to learn good features for classification and/or detection tasks ensuring a consistent generalization performance on OOD test sets. The experimental results qualitatively and quantitatively demonstrate the superior performance of our method using the benchmark CXR image data sets on classification tasks.
翻译:这项工作解决了在分配外测试组中性能退化的中央机器学习问题,这个问题在基于医学成像的诊断系统中特别突出,这种诊断系统似乎准确,但在新的医院/数据集中测试时失败。最近的研究表明,该系统可以学习捷径和非相关特征,而不是一般特征,即所谓的好特征。我们假设对抗性培训可以消除捷径特征,而突出的引导性培训可以筛选出非相关特征;两者都是计算OOOD测试组性能退化的干扰特征。我们为此为深神经网络制定了一个新的示范培训计划,以学习分类和/或检测任务的良好特征,确保OOD测试组的一致通用性能。实验结果从质量和数量上用基准CXR图像组的分类任务来显示我们方法的优异性。