We present the results and the main findings of the NLP4IF-2021 shared tasks. Task 1 focused on fighting the COVID-19 infodemic in social media, and it was offered in Arabic, Bulgarian, and English. Given a tweet, it asked to predict whether that tweet contains a verifiable claim, and if so, whether it is likely to be false, is of general interest, is likely to be harmful, and is worthy of manual fact-checking; also, whether it is harmful to society, and whether it requires the attention of policy makers. Task~2 focused on censorship detection, and was offered in Chinese. A total of ten teams submitted systems for task 1, and one team participated in task 2; nine teams also submitted a system description paper. Here, we present the tasks, analyze the results, and discuss the system submissions and the methods they used. Most submissions achieved sizable improvements over several baselines, and the best systems used pre-trained Transformers and ensembles. The data, the scorers and the leaderboards for the tasks are available at http://gitlab.com/NLP4IF/nlp4if-2021.
翻译:我们介绍了NLP4IF-2021共同任务的结果和主要结论;任务1侧重于打击社交媒体中的COVID-19病毒,以阿拉伯文、保加利亚文和英文提供;任务1侧重于打击社交媒体中的COVID-19病毒;任务1,共有10个小组提交了任务1系统,一个小组参加了任务2;9个小组还提交了一份系统说明文件。在这里,我们介绍任务,分析结果,讨论系统呈件和使用的方法。大多数呈件在几个基线上取得了相当大的改进,并使用了经过预先训练的变压器和编模的最佳系统。数据、计分器和任务领导板见http://gitlab.com/NLP4IF/nlpif-2021。