查明对话系统中的社会偏见:框架、数据集和基准 (Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks) - 专知论文

会员服务 ·

0

有偏 · 可辨认的 · SimPLe · 数据集 · 边缘化 ·

2022 年 2 月 16 日

Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks

翻译：查明对话系统中的社会偏见:框架、数据集和基准

Jingyan Zhou,Jiawen Deng,Fei Mi,Yitong Li,Yasheng Wang,Minlie Huang,Xin Jiang,Qun Liu,Helen Meng

The research of open-domain dialog systems has been greatly prospered by neural models trained on large-scale corpora, however, such corpora often introduce various safety problems (e.g., offensive languages, biases, and toxic behaviors) that significantly hinder the deployment of dialog systems in practice. Among all these unsafe issues, addressing social bias is more complex as its negative impact on marginalized populations is usually expressed implicitly, thus requiring normative reasoning and rigorous analysis. In this paper, we focus our investigation on social bias detection of dialog safety problems. We first propose a novel Dial-Bias Frame for analyzing the social bias in conversations pragmatically, which considers more comprehensive bias-related analyses rather than simple dichotomy annotations. Based on the proposed framework, we further introduce CDail-Bias Dataset that, to our knowledge, is the first well-annotated Chinese social bias dialog dataset. In addition, we establish several dialog bias detection benchmarks at different label granularities and input types (utterance-level and context-level). We show that the proposed in-depth analyses together with these benchmarks in our Dial-Bias Frame are necessary and essential to bias detection tasks and can benefit building safe dialog systems in practice.

翻译：对开放式对话系统的研究由于在大型公司方面受过培训的神经模型而大大繁荣了对开放式对话系统的研究,然而,这种公司常常带来各种安全问题(例如攻击性语言、偏见和有毒行为),严重妨碍对话系统的实际部署。在所有这些不安全问题中,解决社会偏见问题更为复杂,因为社会偏见对边缘化人口的负面影响通常以隐含的方式表示,因此需要进行规范推理和严格分析。在本文件中,我们集中调查对对话安全问题的社会偏见的发现。我们首先提出一个新的Dial-Bias框架,以务实地分析对话中的社会偏见,考虑更全面的偏见分析,而不是简单的二分法说明。根据拟议的框架,我们进一步采用CDail-Bas数据集,据我们所知,这是第一个附有注释的中国社会偏见对话数据集。此外,我们还在不同标签颗粒和投入类型(不相上和上下层)建立了几个对话偏差检测基准。我们表明,拟议在我们的Di-Bas框架中与这些基准一起进行深入分析,对于发现偏见的任务和建设安全对话系统是必要和必不可少的。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【USC2021】常识推理，47页ppt，Commonsense Reasoning in the Wild

专知会员服务

33+阅读 · 2021年10月9日

如何构建你的推荐系统？这份21页ppt教程为你讲解

如何构建你的推荐系统？这份21页ppt教程为你讲解

专知会员服务

65+阅读 · 2021年2月12日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

专知会员服务

96+阅读 · 2020年4月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

基于压缩感知的信号重建快速算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于压缩感知的机械振动信号检测理论及试验研究

国家自然科学基金

1+阅读 · 2014年12月31日

土壤实时可见光/近红外光谱有效信息提取算法及其评价机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

极化层析SAR人造目标三维重构与特征提取研究

国家自然科学基金

1+阅读 · 2013年12月31日

种子优化算法及其在动态优化问题求解中的研究

国家自然科学基金

0+阅读 · 2012年12月31日

智能高速机床MEMS高频加速度传感器研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于压缩感知的稀疏阵列MIMO-SAR成像及动目标检测

国家自然科学基金

0+阅读 · 2012年12月31日

生物启发自治水下机器人轨迹跟踪控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

海流干扰下欠驱动智能水下机器人的三维轨迹跟踪方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于遥感的棉花长势监测模型及其栽培应用

国家自然科学基金

0+阅读 · 2008年12月31日

Robustness Testing of Data and Knowledge Driven Anomaly Detection in Cyber-Physical Systems

Arxiv

0+阅读 · 2022年4月20日

Towards General Purpose Vision Systems

Arxiv

0+阅读 · 2022年4月19日

Expert Finding in Legal Community Question Answering

Arxiv

0+阅读 · 2022年4月19日

Modeling Missing Annotations for Incremental Learning in Object Detection

Arxiv

0+阅读 · 2022年4月19日

When Cyber-Physical Systems Meet AI: A Benchmark, an Evaluation, and a Way Forward

Arxiv

0+阅读 · 2022年4月19日

Saliency in Augmented Reality

Saliency in Augmented Reality

Arxiv

1+阅读 · 2022年4月18日

Evaluation Benchmarks for Spanish Sentence Representations

Arxiv

0+阅读 · 2022年4月15日

The Art of Prompting: Event Detection based on Type Specific Prompts

Arxiv

0+阅读 · 2022年4月14日

Towards Open World Object Detection

Arxiv

13+阅读 · 2021年3月3日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【USC2021】常识推理，47页ppt，Commonsense Reasoning in the Wild

专知会员服务

33+阅读 · 2021年10月9日

如何构建你的推荐系统？这份21页ppt教程为你讲解

如何构建你的推荐系统？这份21页ppt教程为你讲解

专知会员服务

65+阅读 · 2021年2月12日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

专知会员服务

96+阅读 · 2020年4月18日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

相关论文

Robustness Testing of Data and Knowledge Driven Anomaly Detection in Cyber-Physical Systems

Arxiv

0+阅读 · 2022年4月20日

Towards General Purpose Vision Systems

Arxiv

0+阅读 · 2022年4月19日

Expert Finding in Legal Community Question Answering

Arxiv

0+阅读 · 2022年4月19日

Modeling Missing Annotations for Incremental Learning in Object Detection

Arxiv

0+阅读 · 2022年4月19日

When Cyber-Physical Systems Meet AI: A Benchmark, an Evaluation, and a Way Forward

Arxiv

0+阅读 · 2022年4月19日

Saliency in Augmented Reality

Saliency in Augmented Reality

Arxiv

1+阅读 · 2022年4月18日

Evaluation Benchmarks for Spanish Sentence Representations

Arxiv

0+阅读 · 2022年4月15日

The Art of Prompting: Event Detection based on Type Specific Prompts

Arxiv

0+阅读 · 2022年4月14日

Towards Open World Object Detection

Arxiv

13+阅读 · 2021年3月3日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

相关基金

基于压缩感知的信号重建快速算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于压缩感知的机械振动信号检测理论及试验研究

国家自然科学基金

1+阅读 · 2014年12月31日

土壤实时可见光/近红外光谱有效信息提取算法及其评价机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

极化层析SAR人造目标三维重构与特征提取研究

国家自然科学基金

1+阅读 · 2013年12月31日

种子优化算法及其在动态优化问题求解中的研究

国家自然科学基金

0+阅读 · 2012年12月31日

智能高速机床MEMS高频加速度传感器研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于压缩感知的稀疏阵列MIMO-SAR成像及动目标检测

国家自然科学基金

0+阅读 · 2012年12月31日

生物启发自治水下机器人轨迹跟踪控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

海流干扰下欠驱动智能水下机器人的三维轨迹跟踪方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于遥感的棉花长势监测模型及其栽培应用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员