展示基于性别的差异 (Vision-Language Models Performing Zero-Shot Tasks Exhibit Gender-based Disparities) - 专知论文

会员服务 ·

0

Performer · MoDELS · 有偏 · Vision · 模型性能 ·

2023 年 1 月 26 日

Vision-Language Models Performing Zero-Shot Tasks Exhibit Gender-based Disparities

翻译：展示基于性别的差异

Melissa Hall,Laura Gustafson,Aaron Adcock,Ishan Misra,Candace Ross

We explore the extent to which zero-shot vision-language models exhibit gender bias for different vision tasks. Vision models traditionally required task-specific labels for representing concepts, as well as finetuning; zero-shot models like CLIP instead perform tasks with an open-vocabulary, meaning they do not need a fixed set of labels, by using text embeddings to represent concepts. With these capabilities in mind, we ask: Do vision-language models exhibit gender bias when performing zero-shot image classification, object detection and semantic segmentation? We evaluate different vision-language models with multiple datasets across a set of concepts and find (i) all models evaluated show distinct performance differences based on the perceived gender of the person co-occurring with a given concept in the image and that aggregating analyses over all concepts can mask these concerns; (ii) model calibration (i.e. the relationship between accuracy and confidence) also differs distinctly by perceived gender, even when evaluating on similar representations of concepts; and (iii) these observed disparities align with existing gender biases in word embeddings from language models. These findings suggest that, while language greatly expands the capability of vision tasks, it can also contribute to social biases in zero-shot vision settings. Furthermore, biases can further propagate when foundational models like CLIP are used by other models to enable zero-shot capabilities.

翻译：我们探索了零射视觉语言模型对不同视觉任务表现出性别偏见的程度; 视觉模型传统上要求为代表概念和微调提供特定任务标签; 像 CLIP 这样的零射模型以开放词汇执行任务,意味着它们不需要固定的标签,使用文字嵌入来代表概念; 我们考虑到这些能力,问: 视觉语言模型在进行零射图像分类、对象探测和语义分解时是否表现出性别偏见? 我们评估不同视觉语言模型,在一套概念和发现(一) 所有被评估的模型都显示不同的性能差异,其依据是人与特定概念一致的感知性别,并且对所有概念的综合分析可以掩盖这些关切;(二) 模型校正(即准确性和信心之间的关系)也因感知的性别而不同,即使在对类似概念的表达方式进行评估时;以及 (三) 这些观察到的差别与语言模型中的现有性别偏见相一致,并发现:(一) 所有被评估的模型都显示不同的性性差异,其依据是人们所认为的性别特征与特定概念相同,同时语言也能极大地扩展C-基础的定位能力。

0

相关内容

Performer

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

室内多目标的被动定位方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Y-box结合蛋白1/Wnt5a途径诱导上皮间质转化促进胰腺癌干细胞侵袭转移的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Navier-Stokes 方程组的若干存在性问题

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

MKP-4调节ERK信号通路在肝细胞癌发生发展中的意义

国家自然科学基金

0+阅读 · 2013年12月31日

端粒酶在内质网应激中的调控及作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

Periostin在前列腺癌侵袭转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Prohibitin调控癌组织内源性雄激素合成促进前列腺癌激素抵抗性进展机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

SIRT1对椎间盘髓核细胞凋亡调控及信号转导机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

不同构型脂联素在COPD发病中的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Arxiv

1+阅读 · 2023年3月17日

A Generative Model for Digital Camera Noise Synthesis

Arxiv

0+阅读 · 2023年3月17日

Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing

Arxiv

0+阅读 · 2023年3月16日

Can Very Large Pretrained Language Models Learn Storytelling With A Few Examples?

Arxiv

0+阅读 · 2023年3月15日

Towards Enhanced Controllability of Diffusion Models

Arxiv

0+阅读 · 2023年3月15日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

OntoZSL: Ontology-enhanced Zero-shot Learning

Arxiv

17+阅读 · 2021年2月15日

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

Arxiv

11+阅读 · 2018年12月8日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

VIP会员

文章信息

相关主题

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

相关论文

DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Arxiv

1+阅读 · 2023年3月17日

A Generative Model for Digital Camera Noise Synthesis

Arxiv

0+阅读 · 2023年3月17日

Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing

Arxiv

0+阅读 · 2023年3月16日

Can Very Large Pretrained Language Models Learn Storytelling With A Few Examples?

Arxiv

0+阅读 · 2023年3月15日

Towards Enhanced Controllability of Diffusion Models

Arxiv

0+阅读 · 2023年3月15日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

OntoZSL: Ontology-enhanced Zero-shot Learning

Arxiv

17+阅读 · 2021年2月15日

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

Arxiv

11+阅读 · 2018年12月8日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

相关基金

室内多目标的被动定位方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Y-box结合蛋白1/Wnt5a途径诱导上皮间质转化促进胰腺癌干细胞侵袭转移的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Navier-Stokes 方程组的若干存在性问题

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

MKP-4调节ERK信号通路在肝细胞癌发生发展中的意义

国家自然科学基金

0+阅读 · 2013年12月31日

端粒酶在内质网应激中的调控及作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

Periostin在前列腺癌侵袭转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Prohibitin调控癌组织内源性雄激素合成促进前列腺癌激素抵抗性进展机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

SIRT1对椎间盘髓核细胞凋亡调控及信号转导机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

不同构型脂联素在COPD发病中的作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员