自动检测对妇女和移民的网络欺凌和跨域可适应性 (Automated Detection of Cyberbullying Against Women and Immigrants and Cross-domain Adaptability)

Cyberbullying is a prevalent and growing social problem due to the surge of social media technology usage. Minorities, women, and adolescents are among the common victims of cyberbullying. Despite the advancement of NLP technologies, the automated cyberbullying detection remains challenging. This paper focuses on advancing the technology using state-of-the-art NLP techniques. We use a Twitter dataset from SemEval 2019 - Task 5(HatEval) on hate speech against women and immigrants. Our best performing ensemble model based on DistilBERT has achieved 0.73 and 0.74 of F1 score in the task of classifying hate speech (Task A) and aggressiveness and target (Task B) respectively. We adapt the ensemble model developed for Task A to classify offensive language in external datasets and achieved ~0.7 of F1 score using three benchmark datasets, enabling promising results for cross-domain adaptability. We conduct a qualitative analysis of misclassified tweets to provide insightful recommendations for future cyberbullying research.

翻译：由于社交媒体技术的使用激增,网络欺凌是一个普遍和日益加剧的社会问题。少数群体、妇女和青少年是网络欺凌的常见受害者。尽管NLP技术不断进步,自动化网络欺凌探测仍然具有挑战性。本文件侧重于利用最新NLP技术推进技术。我们使用SemEval 2019-关于针对妇女和移民的仇恨言论的第5任务(HatEval)的Twitter数据集。我们基于DistilBERT的最好的组合模型在对仇恨言论(Task A)和攻击性和目标(Task B)进行分类的任务中分别达到F1分的0.73和0.74分。我们调整了为任务A开发的集合模型,在外部数据集中对攻击性语言进行分类,并利用三个基准数据集实现了F1分的~0.7分。我们对错误分类的推文模型进行了定性分析,以便为未来的网络欺凌研究提供有洞察的建议。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

人工智能如何用于抵抗COVID-19？Mila这份《AI against COVID-19 》PPT

专知会员服务

48+阅读 · 2020年5月17日

【伯克利】机器学习蛋白质工程，Machine learning for protein engineering，83页ppt

专知会员服务

36+阅读 · 2020年5月9日

【CVPR2020】视频符号语言识别中跨领域知识的传递, Transferring Cross-domain Knowledge for Video Sign Language Recognition

专知会员服务

9+阅读 · 2020年4月17日

【CVPR2020-Uber】物理上可实现的对抗性的例子，用于激光雷达的目标检测，Physically Realizable Adversarial Examples for LiDAR Object Detection

专知会员服务

22+阅读 · 2020年4月16日