探索自动生成反事实分析的功效 (Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis) - 专知论文

会员服务 ·

0

Performer · state-of-the-art · MoDELS · 可约的 · 情感分析 ·

2021 年 6 月 30 日

Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis

翻译：探索自动生成反事实分析的功效

Linyi Yang,Jiazheng Li,Pádraig Cunningham,Yue Zhang,Barry Smyth,Ruihai Dong

from arxiv, ACL-21, Main Conference, Long Paper

While state-of-the-art NLP models have been achieving the excellent performance of a wide range of tasks in recent years, important questions are being raised about their robustness and their underlying sensitivity to systematic biases that may exist in their training and test data. Such issues come to be manifest in performance problems when faced with out-of-distribution data in the field. One recent solution has been to use counterfactually augmented datasets in order to reduce any reliance on spurious patterns that may exist in the original data. Producing high-quality augmented data can be costly and time-consuming as it usually needs to involve human feedback and crowdsourcing efforts. In this work, we propose an alternative by describing and evaluating an approach to automatically generating counterfactual data for data augmentation and explanation. A comprehensive evaluation on several different datasets and using a variety of state-of-the-art benchmarks demonstrate how our approach can achieve significant improvements in model performance when compared to models training on the original data and even when compared to models trained with the benefit of human-generated augmented data.

翻译：近些年来,尽管最先进的国家劳工政策模型取得了出色地完成一系列广泛任务,但人们正在对其稳健性和对培训和测试数据中可能存在的系统偏差的潜在敏感性提出重要问题,这些问题表现在面对外地分配数据外的绩效问题中。最近的一个解决办法是使用反实际扩大的数据集,以减少对原始数据中可能存在的虚假模式的依赖。产生高质量的扩充数据可能费用高昂,耗费时间,因为通常需要人类反馈和众包努力。在这项工作中,我们提出一种替代办法,说明和评价自动生成反事实数据的方法,用于数据扩充和解释。对若干不同的数据集进行全面评价,并采用各种最新基准,表明与原始数据模型培训相比,我们的方法如何能够在模型业绩方面实现重大改进,即使与经过培训的模型相比,利用人类生成的扩大数据的好处。

0

相关内容

Performer

【EMNLP2020】自然语言生成，Neural Language Generation

【EMNLP2020】自然语言生成，Neural Language Generation

专知会员服务

39+阅读 · 2020年11月20日

【万字长文】注意力机制可解释大论述

专知会员服务

55+阅读 · 2020年11月17日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【CAAI 2019】XLNet and Beyond，杨植麟，联合创始人，循环智能（Recurrent AI）

【CAAI 2019】XLNet and Beyond，杨植麟，联合创始人，循环智能（Recurrent AI）

专知会员服务

14+阅读 · 2019年12月4日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

计算机 | IUI 2020等国际会议信息4条

计算机 | IUI 2020等国际会议信息4条

Call4Papers

6+阅读 · 2019年6月17日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Jointly Improving Summarization and Sentiment Classification

Jointly Improving Summarization and Sentiment Classification

黑龙江大学自然语言处理实验室

3+阅读 · 2018年6月12日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

已删除

将门创投

5+阅读 · 2018年2月28日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

原创 | Attention Modeling for Targeted Sentiment

原创 | Attention Modeling for Targeted Sentiment

黑龙江大学自然语言处理实验室

25+阅读 · 2017年11月5日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

Robust Retrieval Augmented Generation for Zero-shot Slot Filling

Robust Retrieval Augmented Generation for Zero-shot Slot Filling

Arxiv

0+阅读 · 2021年8月31日

A Sentiment Analysis Dataset for Trustworthiness Evaluation

A Sentiment Analysis Dataset for Trustworthiness Evaluation

Arxiv

0+阅读 · 2021年8月30日

Generating Fact Checking Explanations

Generating Fact Checking Explanations

Arxiv

9+阅读 · 2020年4月13日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

Fine-grained Sentiment Analysis with Faithful Attention

Fine-grained Sentiment Analysis with Faithful Attention

Arxiv

5+阅读 · 2019年8月19日

Combination of Domain Knowledge and Deep Learning for Sentiment Analysis

Arxiv

3+阅读 · 2018年6月22日

Neural Models for Key Phrase Detection and Question Generation

Arxiv

4+阅读 · 2018年5月30日

Multimodal Sentiment Analysis To Explore the Structure of Emotions

Arxiv

19+阅读 · 2018年5月25日

$ρ$-hot Lexicon Embedding-based Two-level LSTM for Sentiment Analysis

Arxiv

6+阅读 · 2018年3月21日

SentiPers: A Sentiment Analysis Corpus for Persian

Arxiv

5+阅读 · 2018年1月23日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【EMNLP2020】自然语言生成，Neural Language Generation

【EMNLP2020】自然语言生成，Neural Language Generation

专知会员服务

39+阅读 · 2020年11月20日

【万字长文】注意力机制可解释大论述

专知会员服务

55+阅读 · 2020年11月17日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【CAAI 2019】XLNet and Beyond，杨植麟，联合创始人，循环智能（Recurrent AI）

【CAAI 2019】XLNet and Beyond，杨植麟，联合创始人，循环智能（Recurrent AI）

专知会员服务

14+阅读 · 2019年12月4日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

计算机 | IUI 2020等国际会议信息4条

计算机 | IUI 2020等国际会议信息4条

Call4Papers

6+阅读 · 2019年6月17日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Jointly Improving Summarization and Sentiment Classification

Jointly Improving Summarization and Sentiment Classification

黑龙江大学自然语言处理实验室

3+阅读 · 2018年6月12日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

已删除

将门创投

5+阅读 · 2018年2月28日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

原创 | Attention Modeling for Targeted Sentiment

原创 | Attention Modeling for Targeted Sentiment

黑龙江大学自然语言处理实验室

25+阅读 · 2017年11月5日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

Robust Retrieval Augmented Generation for Zero-shot Slot Filling

Robust Retrieval Augmented Generation for Zero-shot Slot Filling

Arxiv

0+阅读 · 2021年8月31日

A Sentiment Analysis Dataset for Trustworthiness Evaluation

A Sentiment Analysis Dataset for Trustworthiness Evaluation

Arxiv

0+阅读 · 2021年8月30日

Generating Fact Checking Explanations

Generating Fact Checking Explanations

Arxiv

9+阅读 · 2020年4月13日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

Fine-grained Sentiment Analysis with Faithful Attention

Fine-grained Sentiment Analysis with Faithful Attention

Arxiv

5+阅读 · 2019年8月19日

Combination of Domain Knowledge and Deep Learning for Sentiment Analysis

Arxiv

3+阅读 · 2018年6月22日

Neural Models for Key Phrase Detection and Question Generation

Arxiv

4+阅读 · 2018年5月30日

Multimodal Sentiment Analysis To Explore the Structure of Emotions

Arxiv

19+阅读 · 2018年5月25日

$ρ$-hot Lexicon Embedding-based Two-level LSTM for Sentiment Analysis

Arxiv

6+阅读 · 2018年3月21日

SentiPers: A Sentiment Analysis Corpus for Persian

Arxiv

5+阅读 · 2018年1月23日

微信扫码咨询专知VIP会员