利用Ktrain-BERT在Twitter讨论中识别仇恨言论的概率影响评分生成 (Probabilistic Impact Score Generation using Ktrain-BERT to Identify Hate Words from Twitter Discussions) - 专知论文

会员服务 ·

0

可辨认的 · 得分 · Twitter · Performer · CASES ·

2021 年 11 月 25 日

Probabilistic Impact Score Generation using Ktrain-BERT to Identify Hate Words from Twitter Discussions

翻译：利用Ktrain-BERT在Twitter讨论中识别仇恨言论的概率影响评分生成

Sourav Das,Prasanta Mandal,Sanjay Chatterji

from arxiv, 11 Pages, 10 Figures

Social media has seen a worrying rise in hate speech in recent times. Branching to several distinct categories of cyberbullying, gender discrimination, or racism, the combined label for such derogatory content can be classified as toxic content in general. This paper presents experimentation with a Keras wrapped lightweight BERT model to successfully identify hate speech and predict probabilistic impact score for the same to extract the hateful words within sentences. The dataset used for this task is the Hate Speech and Offensive Content Detection (HASOC 2021) data from FIRE 2021 in English. Our system obtained a validation accuracy of 82.60%, with a maximum F1-Score of 82.68%. Subsequently, our predictive cases performed significantly well in generating impact scores for successful identification of the hate tweets as well as the hateful words from tweet pools.

翻译：最近,社交媒体的仇恨言论出现了令人担忧的上升。在网络欺凌、性别歧视或种族主义等几类不同类型的网络欺凌、性别歧视或种族主义中,这类贬损内容的合并标签可被归为一般有毒内容。本文展示了Keras包装轻重量BERT模型的实验,以成功识别仇恨言论,并预测同一种语言的概率影响得分,从而在句中提取仇恨词。这项任务使用的数据集是来自FIRE 2021英文版的仇恨言词和攻击性内容探测(HASOC 2021)数据。我们的系统获得了82.60%的验证准确性,最高F1-STRO为82.68%。随后,我们的预测案例在成功识别仇恨推特以及推特库中的恶言方面产生了显著的影响分数。

0

相关内容

可辨认的

【Twitter】时序图神经网络

【Twitter】时序图神经网络

专知会员服务

95+阅读 · 2020年10月15日

【2020新书】社交媒体挖掘，212pdf，Mining Social Media

【2020新书】社交媒体挖掘，212pdf，Mining Social Media

专知会员服务

63+阅读 · 2020年7月30日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

如何画出漂亮BERT模型图？这份10页PPT帮你快速搞定，来自Jimmy Lin

如何画出漂亮BERT模型图？这份10页PPT帮你快速搞定，来自Jimmy Lin

专知会员服务

88+阅读 · 2020年7月22日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

专知会员服务

35+阅读 · 2020年4月30日

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

专知会员服务

79+阅读 · 2020年2月12日

【芝加哥大学】GRAPH-BERT: Only Attention is Needed for Learning Graph Representations

【芝加哥大学】GRAPH-BERT: Only Attention is Needed for Learning Graph Representations

专知会员服务

85+阅读 · 2020年1月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

已删除

将门创投

8+阅读 · 2019年1月4日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

暗通沟渠：Multi-lingual Attention

暗通沟渠：Multi-lingual Attention

我爱读PAMI

7+阅读 · 2018年2月24日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

The CARE Dataset for Affective Response Detection

Arxiv

0+阅读 · 2022年1月28日

Challenges and Opportunities for Machine Learning Classification of Behavior and Mental State from Images

Arxiv

0+阅读 · 2022年1月26日

DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence

Arxiv

0+阅读 · 2022年1月26日

Text-to-Image Synthesis Based on Machine Generated Captions

Text-to-Image Synthesis Based on Machine Generated Captions

Arxiv

3+阅读 · 2019年10月9日

Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial Event Extraction

Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial Event Extraction

Arxiv

11+阅读 · 2019年9月23日

A Simple BERT-Based Approach for Lexical Simplification

A Simple BERT-Based Approach for Lexical Simplification

Arxiv

6+阅读 · 2019年7月16日

What Does BERT Look At? An Analysis of BERT's Attention

Arxiv

4+阅读 · 2019年6月11日

BERTScore: Evaluating Text Generation with BERT

Arxiv

5+阅读 · 2019年4月21日

Improved Speech Enhancement with the Wave-U-Net

Arxiv

8+阅读 · 2018年11月27日

Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs

Arxiv

8+阅读 · 2018年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

【Twitter】时序图神经网络

【Twitter】时序图神经网络

专知会员服务

95+阅读 · 2020年10月15日

【2020新书】社交媒体挖掘，212pdf，Mining Social Media

【2020新书】社交媒体挖掘，212pdf，Mining Social Media

专知会员服务

63+阅读 · 2020年7月30日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

如何画出漂亮BERT模型图？这份10页PPT帮你快速搞定，来自Jimmy Lin

如何画出漂亮BERT模型图？这份10页PPT帮你快速搞定，来自Jimmy Lin

专知会员服务

88+阅读 · 2020年7月22日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

专知会员服务

35+阅读 · 2020年4月30日

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

专知会员服务

79+阅读 · 2020年2月12日

【芝加哥大学】GRAPH-BERT: Only Attention is Needed for Learning Graph Representations

【芝加哥大学】GRAPH-BERT: Only Attention is Needed for Learning Graph Representations

专知会员服务

85+阅读 · 2020年1月15日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

已删除

将门创投

8+阅读 · 2019年1月4日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

暗通沟渠：Multi-lingual Attention

暗通沟渠：Multi-lingual Attention

我爱读PAMI

7+阅读 · 2018年2月24日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

The CARE Dataset for Affective Response Detection

Arxiv

0+阅读 · 2022年1月28日

Challenges and Opportunities for Machine Learning Classification of Behavior and Mental State from Images

Arxiv

0+阅读 · 2022年1月26日

DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence

Arxiv

0+阅读 · 2022年1月26日

Text-to-Image Synthesis Based on Machine Generated Captions

Text-to-Image Synthesis Based on Machine Generated Captions

Arxiv

3+阅读 · 2019年10月9日

Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial Event Extraction

Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial Event Extraction

Arxiv

11+阅读 · 2019年9月23日

A Simple BERT-Based Approach for Lexical Simplification

A Simple BERT-Based Approach for Lexical Simplification

Arxiv

6+阅读 · 2019年7月16日

What Does BERT Look At? An Analysis of BERT's Attention

Arxiv

4+阅读 · 2019年6月11日

BERTScore: Evaluating Text Generation with BERT

Arxiv

5+阅读 · 2019年4月21日

Improved Speech Enhancement with the Wave-U-Net

Arxiv

8+阅读 · 2018年11月27日

Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs

Arxiv

8+阅读 · 2018年1月16日

微信扫码咨询专知VIP会员