HERDPhobia:尼日利亚针对富拉尼人的仇恨言论数据集 (HERDPhobia: A Dataset for Hate Speech against Fulani in Nigeria) - 专知论文

会员服务 ·

0

数据集 · MoDELS · HTTPS · Performer · Feel ·

2022 年 11 月 28 日

HERDPhobia: A Dataset for Hate Speech against Fulani in Nigeria

翻译：HERDPhobia:尼日利亚针对富拉尼人的仇恨言论数据集

Saminu Mohammad Aliyu,Gregory Maksha Wajiga,Muhammad Murtala,Shamsuddeen Hassan Muhammad,Idris Abdulmumin,Ibrahim Said Ahmad

from arxiv, To appear in the Proceedings of the Sixth Workshop on Widening Natural Language Processing at EMNLP2022

Social media platforms allow users to freely share their opinions about issues or anything they feel like. However, they also make it easier to spread hate and abusive content. The Fulani ethnic group has been the victim of this unfortunate phenomenon. This paper introduces the HERDPhobia - the first annotated hate speech dataset on Fulani herders in Nigeria - in three languages: English, Nigerian-Pidgin, and Hausa. We present a benchmark experiment using pre-trained languages models to classify the tweets as either hateful or non-hateful. Our experiment shows that the XML-T model provides better performance with 99.83% weighted F1. We released the dataset at https://github.com/hausanlp/HERDPhobia for further research.

翻译：社交媒体平台允许用户自由分享自己对问题或任何他们喜欢的东西的看法。然而, 他们也更容易传播仇恨和虐待内容。富拉尼族群体是这一不幸现象的受害者。本文介绍了HERDPhobia, 这是尼日利亚Fulani牧民首个附加注释的仇恨言论数据集, 使用三种语言: 英文、尼日利亚语- Pidgin 和 Hausa。我们提出了一个基准实验, 使用预先培训的语言模式将推特归类为仇恨或不恨。我们的实验显示, XML- T 模式提供了99.83%加权F1的更好表现。我们在 https://github.com/hausanp/ HehrDPhobia 上发布了数据集, 供进一步研究。

0

相关内容

数据集

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【数据集】新的YELP数据集官方下载

【数据集】新的YELP数据集官方下载

机器学习研究会

16+阅读 · 2017年8月31日

长链非编码RNA HOXD-AS1促进人肝细胞癌增殖的作用及分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

抑制Hedgehog信号通路的植物C21甾体化合物的构效关系、结构优化及抗肿瘤作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

高良姜根茎抗肿瘤活性成分的筛选、结构修饰及其构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

REVOLUTA基因在豆科模式植物蒺藜苜蓿复叶发育中的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

适用于无线传感器网络SOC的低功耗低成本SAR型A/D转换器设计研究

国家自然科学基金

0+阅读 · 2013年12月31日

三氧化二砷降解HER2蛋白的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

4f和3d电子调控下的新型In和Te基稀土1：3型半导体化合物的磁输运和结构

国家自然科学基金

0+阅读 · 2012年12月31日

OsDCL3b基因调控稻穗生长发育的遗传机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Zero-Shot Rumor Detection with Propagation Structure via Prompt Learning

Arxiv

0+阅读 · 2023年1月30日

Visual Belief Elicitation Reduces the Incidence of False Discovery

Arxiv

0+阅读 · 2023年1月29日

Data Origin Inference in Machine Learning

Arxiv

0+阅读 · 2023年1月29日

How to Make Users Adopt More Sustainable Cryptocurrencies: Evidence from Nigeria

Arxiv

0+阅读 · 2023年1月27日

Theoretical Analysis of Offline Imitation With Supplementary Dataset

Arxiv

0+阅读 · 2023年1月27日

Nik Defense: An Artificial Intelligence Based Defense Mechanism against Selfish Mining in Bitcoin

Arxiv

0+阅读 · 2023年1月26日

Explaining Patterns in Data with Language Models via Interpretable Autoprompting

Arxiv

0+阅读 · 2023年1月26日

Domain Generalization in Vision: A Survey

Arxiv

16+阅读 · 2021年7月18日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《生成式人工智能与大/小语言模型在供应链管理决策优化与可持续性提升中的作用评估》最新51页

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

地下战：地下空间的战略博弈

《美地下作战条令手册》228页

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【数据集】新的YELP数据集官方下载

【数据集】新的YELP数据集官方下载

机器学习研究会

16+阅读 · 2017年8月31日

相关论文

Zero-Shot Rumor Detection with Propagation Structure via Prompt Learning

Arxiv

0+阅读 · 2023年1月30日

Visual Belief Elicitation Reduces the Incidence of False Discovery

Arxiv

0+阅读 · 2023年1月29日

Data Origin Inference in Machine Learning

Arxiv

0+阅读 · 2023年1月29日

How to Make Users Adopt More Sustainable Cryptocurrencies: Evidence from Nigeria

Arxiv

0+阅读 · 2023年1月27日

Theoretical Analysis of Offline Imitation With Supplementary Dataset

Arxiv

0+阅读 · 2023年1月27日

Nik Defense: An Artificial Intelligence Based Defense Mechanism against Selfish Mining in Bitcoin

Arxiv

0+阅读 · 2023年1月26日

Explaining Patterns in Data with Language Models via Interpretable Autoprompting

Arxiv

0+阅读 · 2023年1月26日

Domain Generalization in Vision: A Survey

Arxiv

16+阅读 · 2021年7月18日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

相关基金

长链非编码RNA HOXD-AS1促进人肝细胞癌增殖的作用及分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

抑制Hedgehog信号通路的植物C21甾体化合物的构效关系、结构优化及抗肿瘤作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

高良姜根茎抗肿瘤活性成分的筛选、结构修饰及其构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

REVOLUTA基因在豆科模式植物蒺藜苜蓿复叶发育中的功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

适用于无线传感器网络SOC的低功耗低成本SAR型A/D转换器设计研究

国家自然科学基金

0+阅读 · 2013年12月31日

三氧化二砷降解HER2蛋白的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Intraflagellar Transport运输纤毛蛋白的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

4f和3d电子调控下的新型In和Te基稀土1：3型半导体化合物的磁输运和结构

国家自然科学基金

0+阅读 · 2012年12月31日

OsDCL3b基因调控稻穗生长发育的遗传机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员