检测和 Perturb:通过基于梯度的编码方式,中立地重写偏见和敏感文本 (Detect and Perturb: Neutral Rewriting of Biased and Sensitive Text via Gradient-based Decoding) - 专知论文

会员服务 ·

0

有偏 · 解码 · UniFormer · 均匀分布 · 生成模型 ·

2021 年 9 月 24 日

Detect and Perturb: Neutral Rewriting of Biased and Sensitive Text via Gradient-based Decoding

翻译：检测和 Perturb:通过基于梯度的编码方式,中立地重写偏见和敏感文本

Zexue He,Bodhisattwa Prasad Majumder,Julian McAuley

from arxiv, To appear at EMNLP-2021 as Findings

Written language carries explicit and implicit biases that can distract from meaningful signals. For example, letters of reference may describe male and female candidates differently, or their writing style may indirectly reveal demographic characteristics. At best, such biases distract from the meaningful content of the text; at worst they can lead to unfair outcomes. We investigate the challenge of re-generating input sentences to 'neutralize' sensitive attributes while maintaining the semantic meaning of the original text (e.g. is the candidate qualified?). We propose a gradient-based rewriting framework, Detect and Perturb to Neutralize (DEPEN), that first detects sensitive components and masks them for regeneration, then perturbs the generation model at decoding time under a neutralizing constraint that pushes the (predicted) distribution of sensitive attributes towards a uniform distribution. Our experiments in two different scenarios show that DEPEN can regenerate fluent alternatives that are neutral in the sensitive attribute while maintaining the semantics of other attributes.

翻译：书面文字含有明确和隐含的偏差,可以转移有意义的信号。例如,参考书可能以不同的方式描述男性和女性候选人,或者他们的写作风格可能间接揭示人口特征。充其量,这种偏差会分散对文本中有意义的内容的注意力;最坏的是,它们可能导致不公平的结果。我们调查了在保持原始文本的语义含义的同时,重新生成输入句“失效”敏感属性的挑战(例如,候选人有资格吗?)我们提议了一个基于梯度的重写框架,即检测和抄写以中立化(DEPEN),首先检测敏感成分并掩盖它们再生,然后在将敏感属性的(预先)分布推向统一分布的中性限制下,在解码时间干扰一代模式。我们在两种不同情况下的实验表明,DEPEN可以重新生成敏感属性中中性流的替代物,同时保持其他属性的语义。

0

相关内容

基于语言模型的预训练技术研究综述

专知会员服务

57+阅读 · 2021年10月12日

自然语言生成综述

专知会员服务

65+阅读 · 2021年5月29日

文本情感对话系统研究综述

专知会员服务

74+阅读 · 2021年5月21日

神经机器翻译前沿综述

专知会员服务

28+阅读 · 2020年9月9日

【论文推荐】文本摘要简述

【论文推荐】文本摘要简述

专知会员服务

69+阅读 · 2020年7月20日

【中科院计算所 | 文献综述】自然语言生成的无监督前训练:文献综述，Unsupervised Pre-training for Natural Language Generation: A Literature Review

【中科院计算所 | 文献综述】自然语言生成的无监督前训练:文献综述，Unsupervised Pre-training for Natural Language Generation: A Literature Review

专知会员服务

48+阅读 · 2019年11月15日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

【2019-26期】This Week in Extracellular Vesicles

【2019-26期】This Week in Extracellular Vesicles

外泌体之家

11+阅读 · 2019年6月28日

一文了解自然语言生成演变史！

一文了解自然语言生成演变史！

AI前线

5+阅读 · 2019年5月2日

Nature 一周论文导读 | 2019 年 4 月 4 日

Nature 一周论文导读 | 2019 年 4 月 4 日

科研圈

7+阅读 · 2019年4月14日

自然语言生成的演变史

自然语言生成的演变史

专知

25+阅读 · 2019年3月23日

已删除

将门创投

4+阅读 · 2018年11月20日

FastText的内部机制

FastText的内部机制

黑龙江大学自然语言处理实验室

5+阅读 · 2018年7月25日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Improving the robustness and accuracy of biomedical language models through adversarial training

Improving the robustness and accuracy of biomedical language models through adversarial training

Arxiv

0+阅读 · 2021年11月16日

SynthBio: A Case Study in Human-AI Collaborative Curation of Text Datasets

Arxiv

0+阅读 · 2021年11月11日

A Survey on GANs for Anomaly Detection

A Survey on GANs for Anomaly Detection

Arxiv

7+阅读 · 2021年9月14日

Knowledge-based Review Generation by Coherence Enhanced Text Planning

Arxiv

7+阅读 · 2021年5月9日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Reverse Engineering Configurations of Neural Text Generation Models

Arxiv

5+阅读 · 2020年4月13日

Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning

Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning

Arxiv

3+阅读 · 2020年3月17日

Neural Models for Key Phrase Detection and Question Generation

Arxiv

4+阅读 · 2018年5月30日

Saliency-Enhanced Robust Visual Tracking

Arxiv

6+阅读 · 2018年2月8日

Memory Networks

Arxiv

3+阅读 · 2015年11月29日

VIP会员

文章信息

相关主题

相关VIP内容

基于语言模型的预训练技术研究综述

专知会员服务

57+阅读 · 2021年10月12日

自然语言生成综述

专知会员服务

65+阅读 · 2021年5月29日

文本情感对话系统研究综述

专知会员服务

74+阅读 · 2021年5月21日

神经机器翻译前沿综述

专知会员服务

28+阅读 · 2020年9月9日

【论文推荐】文本摘要简述

【论文推荐】文本摘要简述

专知会员服务

69+阅读 · 2020年7月20日

【中科院计算所 | 文献综述】自然语言生成的无监督前训练:文献综述，Unsupervised Pre-training for Natural Language Generation: A Literature Review

【中科院计算所 | 文献综述】自然语言生成的无监督前训练:文献综述，Unsupervised Pre-training for Natural Language Generation: A Literature Review

专知会员服务

48+阅读 · 2019年11月15日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

【2019-26期】This Week in Extracellular Vesicles

【2019-26期】This Week in Extracellular Vesicles

外泌体之家

11+阅读 · 2019年6月28日

一文了解自然语言生成演变史！

一文了解自然语言生成演变史！

AI前线

5+阅读 · 2019年5月2日

Nature 一周论文导读 | 2019 年 4 月 4 日

Nature 一周论文导读 | 2019 年 4 月 4 日

科研圈

7+阅读 · 2019年4月14日

自然语言生成的演变史

自然语言生成的演变史

专知

25+阅读 · 2019年3月23日

已删除

将门创投

4+阅读 · 2018年11月20日

FastText的内部机制

FastText的内部机制

黑龙江大学自然语言处理实验室

5+阅读 · 2018年7月25日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Improving the robustness and accuracy of biomedical language models through adversarial training

Improving the robustness and accuracy of biomedical language models through adversarial training

Arxiv

0+阅读 · 2021年11月16日

SynthBio: A Case Study in Human-AI Collaborative Curation of Text Datasets

Arxiv

0+阅读 · 2021年11月11日

A Survey on GANs for Anomaly Detection

A Survey on GANs for Anomaly Detection

Arxiv

7+阅读 · 2021年9月14日

Knowledge-based Review Generation by Coherence Enhanced Text Planning

Arxiv

7+阅读 · 2021年5月9日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Reverse Engineering Configurations of Neural Text Generation Models

Arxiv

5+阅读 · 2020年4月13日

Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning

Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning

Arxiv

3+阅读 · 2020年3月17日

Neural Models for Key Phrase Detection and Question Generation

Arxiv

4+阅读 · 2018年5月30日

Saliency-Enhanced Robust Visual Tracking

Arxiv

6+阅读 · 2018年2月8日

Memory Networks

Arxiv

3+阅读 · 2015年11月29日

微信扫码咨询专知VIP会员