HAT5:使用文本到文本转换转换器识别仇恨语言 (HaT5: Hate Language Identification using Text-to-Text Transfer Transformer) - 专知论文

会员服务 ·

0

Performer · T5 · SOTA · MoDELS · 数据集 ·

2022 年 2 月 11 日

HaT5: Hate Language Identification using Text-to-Text Transfer Transformer

翻译：HAT5:使用文本到文本转换转换器识别仇恨语言

Sana Sabah Sabry,Tosin Adewumi,Nosheen Abid,György Kovacs,Foteini Liwicki,Marcus Liwicki

from arxiv, 7 pages, 3 figures , conference

We investigate the performance of a state-of-the art (SoTA) architecture T5 (available on the SuperGLUE) and compare with it 3 other previous SoTA architectures across 5 different tasks from 2 relatively diverse datasets. The datasets are diverse in terms of the number and types of tasks they have. To improve performance, we augment the training data by using an autoregressive model. We achieve near-SoTA results on a couple of the tasks - macro F1 scores of 81.66% for task A of the OLID 2019 dataset and 82.54% for task A of the hate speech and offensive content (HASOC) 2021 dataset, where SoTA are 82.9% and 83.05%, respectively. We perform error analysis and explain why one of the models (Bi-LSTM) makes the predictions it does by using a publicly available algorithm: Integrated Gradient (IG). This is because explainable artificial intelligence (XAI) is essential for earning the trust of users. The main contributions of this work are the implementation method of T5, which is discussed; the data augmentation using a new conversational AI model checkpoint, which brought performance improvements; and the revelation on the shortcomings of HASOC 2021 dataset. It reveals the difficulties of poor data annotation by using a small set of examples where the T5 model made the correct predictions, even when the ground truth of the test set were incorrect (in our opinion). We also provide our model checkpoints on the HuggingFace hub1 to foster transparency.

翻译：我们调查艺术( SoTA) 架构 T5 (可在 SuperGLUE上查阅) 的性能,并与它进行比较,从2个相对多样化的数据集的5个不同任务中,将前3个STA结构分为5个不同的任务。数据集在数量和任务类型上各不相同。为了改进性能,我们使用自动反向模型来增加培训数据。我们在几项任务上取得了接近SoTA的结果 — OLID 2019数据集A任务A的宏式F1分81.66%和任务A的82.54%(HASOC) 2021数据集 A(HSOC) 2021数据集 A, SoTA为82.9% 和83.05%。我们进行错误分析,并解释为什么其中一个模型(BI-LSTM) 使用公开可用的算法(综合梯度(GIG) ) 来进行预测。这是因为可以解释的人工智能(XAI) 对于获得用户的信任至关重要。这项工作的主要贡献是T5的执行方法, 正在讨论的还有T5 ; 数据模型增强数据模型, 使用新的对话的模型, 显示TRAIS II 的模型显示2021 的缺陷的模型显示的缺陷。

0

相关内容

Performer

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

320+阅读 · 2020年11月26日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于分层与或图模型的光学遥感图像场景理解方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

泛素交联酶UbcH7在DNA损伤应答过程中的功能和机制分析

国家自然科学基金

0+阅读 · 2015年12月31日

猴子PMd区神经集群对“伸-抓”动作的编码机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

电子商务平台的广告拍卖机制研究

国家自然科学基金

2+阅读 · 2013年12月31日

交互式图像搜索中的小样本学习问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

ICC胃动素受体在红霉素促胃肠动力中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

内质网应激在砷致神经细胞毒性中的作用机制及干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

半枝莲活性成分双向调节VEGF与DC机制新探索

国家自然科学基金

0+阅读 · 2008年12月31日

Detecting Text Formality: A Study of Text Classification Approaches

Detecting Text Formality: A Study of Text Classification Approaches

Arxiv

0+阅读 · 2022年4月19日

Mono vs Multilingual BERT for Hate Speech Detection and Text Classification: A Case Study in Marathi

Arxiv

0+阅读 · 2022年4月19日

HFT-ONLSTM: Hierarchical and Fine-Tuning Multi-label Text Classification

Arxiv

0+阅读 · 2022年4月18日

Language Contamination Explains the Cross-lingual Capabilities of English Pretrained Models

Arxiv

0+阅读 · 2022年4月17日

SimpleBERT: A Pre-trained Model That Learns to Generate Simple Words

Arxiv

0+阅读 · 2022年4月16日

On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages

Arxiv

0+阅读 · 2022年4月5日

How Different are Pre-trained Transformers for Text Ranking?

Arxiv

0+阅读 · 2022年4月5日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

How to Fine-Tune BERT for Text Classification?

How to Fine-Tune BERT for Text Classification?

Arxiv

13+阅读 · 2019年5月14日

Order-Free RNN with Visual Attention for Multi-Label Classification

Arxiv

16+阅读 · 2017年12月20日

VIP会员

文章信息

相关主题

相关VIP内容

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

320+阅读 · 2020年11月26日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACL2025教程】大语言模型的护栏与安全性：对其应用的安全、可靠与可控引导

《实现协同自主：从人机协作到多智能体系统》最新190页

【ICML2025】SToFM：一种用于空间转录组学的多尺度基础模型

通信网络智能体白皮书V1.0，61页pdf

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Detecting Text Formality: A Study of Text Classification Approaches

Detecting Text Formality: A Study of Text Classification Approaches

Arxiv

0+阅读 · 2022年4月19日

Mono vs Multilingual BERT for Hate Speech Detection and Text Classification: A Case Study in Marathi

Arxiv

0+阅读 · 2022年4月19日

HFT-ONLSTM: Hierarchical and Fine-Tuning Multi-label Text Classification

Arxiv

0+阅读 · 2022年4月18日

Language Contamination Explains the Cross-lingual Capabilities of English Pretrained Models

Arxiv

0+阅读 · 2022年4月17日

SimpleBERT: A Pre-trained Model That Learns to Generate Simple Words

Arxiv

0+阅读 · 2022年4月16日

On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages

Arxiv

0+阅读 · 2022年4月5日

How Different are Pre-trained Transformers for Text Ranking?

Arxiv

0+阅读 · 2022年4月5日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

How to Fine-Tune BERT for Text Classification?

How to Fine-Tune BERT for Text Classification?

Arxiv

13+阅读 · 2019年5月14日

Order-Free RNN with Visual Attention for Multi-Label Classification

Arxiv

16+阅读 · 2017年12月20日

相关基金

基于分层与或图模型的光学遥感图像场景理解方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

泛素交联酶UbcH7在DNA损伤应答过程中的功能和机制分析

国家自然科学基金

0+阅读 · 2015年12月31日

猴子PMd区神经集群对“伸-抓”动作的编码机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

电子商务平台的广告拍卖机制研究

国家自然科学基金

2+阅读 · 2013年12月31日

交互式图像搜索中的小样本学习问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

ICC胃动素受体在红霉素促胃肠动力中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

内质网应激在砷致神经细胞毒性中的作用机制及干预研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

半枝莲活性成分双向调节VEGF与DC机制新探索

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员