深度模型迭代偏差校正的可解释AI生命周期：揭示与修改 (Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models)

State-of-the-art machine learning models often learn spurious correlations embedded in the training data. This poses risks when deploying these models for high-stake decision-making, such as in medical applications like skin cancer detection. To tackle this problem, we propose Reveal to Revise (R2R), a framework entailing the entire eXplainable Artificial Intelligence (XAI) life cycle, enabling practitioners to iteratively identify, mitigate, and (re-)evaluate spurious model behavior with a minimal amount of human interaction. In the first step (1), R2R reveals model weaknesses by finding outliers in attributions or through inspection of latent concepts learned by the model. Secondly (2), the responsible artifacts are detected and spatially localized in the input data, which is then leveraged to (3) revise the model behavior. Concretely, we apply the methods of RRR, CDEP and ClArC for model correction, and (4) (re-)evaluate the model's performance and remaining sensitivity towards the artifact. Using two medical benchmark datasets for Melanoma detection and bone age estimation, we apply our R2R framework to VGG, ResNet and EfficientNet architectures and thereby reveal and correct real dataset-intrinsic artifacts, as well as synthetic variants in a controlled setting. Completing the XAI life cycle, we demonstrate multiple R2R iterations to mitigate different biases. Code is available on https://github.com/maxdreyer/Reveal2Revise.

翻译：最先进的机器学习模型往往会学习嵌入在训练数据中的假相关性。当将这些模型部署于高风险的决策制定中，例如皮肤癌检测等医疗应用时，这会带来风险。为了解决这个问题，我们提出了“揭示到修订”（R2R）框架，涵盖了整个可解释人工智能（XAI）生命周期，使从业人员能够迭代地识别、缓解和（重新）评估具有最少人类干预的虚假模型行为。在第一步（1）中，R2R通过找到指标中的异常值或通过检查模型学习的潜在概念来揭示模型的缺陷。其次（2），检测到的相关因素会在输入数据中被识别并在其上执行空间定位，然后用于(3)修改模型行为。具体而言，我们应用RRR、CDEP和ClArC的方法进行模型修正，然后(4)（重新）评估模型的性能和对问题的剩余敏感性。利用Melanoma检测和骨龄估计两个医疗基准数据集，我们将我们的R2R框架应用于VGG、ResNet和EfficientNet架构，并在控制实验中揭示和纠正了真实的数据集内在相关因素及其合成变体。通过完成XAI生命周期，我们展示了多个R2R迭代，以缓解不同的偏见。代码可在https://github.com/maxdreyer/Reveal2Revise上获得。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！华盛顿大学最新《可解释人工智能》课程，系统讲述XAI最新进展

专知会员服务

70+阅读 · 2022年9月14日

【牛津大学】电子医疗记录的生成式对抗网络:应用、评估措施和数据来源综述，A review of Generative Adversarial Networks for Electronic Health Records: applications, evaluation measures and data sources

专知会员服务

24+阅读 · 2022年3月15日

【伯克利Roshan Rao博士论文】训练，评估和理解蛋白质序列的进化模型，Training, Evaluating, and Understanding Evolutionary Models for Protein Sequences

专知会员服务

17+阅读 · 2022年3月6日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日