数据集接口:利用可控反事实生成分析模型失败模式 (Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation)

Distribution shifts are a major source of failure of deployed machine learning models. However, evaluating a model's reliability under distribution shifts can be challenging, especially since it may be difficult to acquire counterfactual examples that exhibit a specified shift. In this work, we introduce dataset interfaces: a framework which allows users to scalably synthesize such counterfactual examples from a given dataset. Specifically, we represent each class from the input dataset as a custom token within the text space of a text-to-image diffusion model. By incorporating these tokens into natural language prompts, we can then generate instantiations of objects in that dataset under desired distribution shifts. We demonstrate how applying our framework to the ImageNet dataset enables us to study model behavior across a diverse array of shifts, including variations in background, lighting, and attributes of the objects themselves. Code available at https://github.com/MadryLab/dataset-interfaces.

翻译：分布式转换是已部署的机器学习模型失败的一个主要原因。但是,在分布式转换中评价模型的可靠性可能具有挑战性,特别是因为可能难以获得显示特定变化的反事实例子。在这项工作中,我们引入了数据集界面:一个使用户能够将特定数据集中的此类反事实例子进行大规模合成的框架。具体地说,我们在文本到图像传播模型的文本空间中代表输入数据集中的每个类别作为自定义符号。通过将这些符号纳入自然语言提示,我们就可以在想要的分布式转换中生成该数据集中的物体即时反应。我们展示了如何将我们的框架应用到图像网数据集,使我们能够研究各种变化的模型行为,包括这些物体本身的背景、照明和属性的变化。代码可在 https://github.com/MadryLab/dataset-interfaces查阅。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

34+阅读 · 2022年3月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日