跨语言COVID-19假消息探测 (Cross-lingual COVID-19 Fake News Detection) - 专知论文

会员服务 ·

0

COVID-19 · 数据集 · INFORMS · HTTPS · 深度学习框架 ·

2021 年 10 月 13 日

Cross-lingual COVID-19 Fake News Detection

翻译：跨语言COVID-19假消息探测

Jiangshu Du,Yingtong Dou,Congying Xia,Limeng Cui,Jing Ma,Philip S. Yu

from arxiv, Accepted by SDM at ICDM, data is available at https://github.com/YingtongDou/CrossFake

The COVID-19 pandemic poses a great threat to global public health. Meanwhile, there is massive misinformation associated with the pandemic which advocates unfounded or unscientific claims. Even major social media and news outlets have made an extra effort in debunking COVID-19 misinformation, most of the fact-checking information is in English, whereas some unmoderated COVID-19 misinformation is still circulating in other languages, threatening the health of less-informed people in immigrant communities and developing countries. In this paper, we make the first attempt to detect COVID-19 misinformation in a low-resource language (Chinese) only using the fact-checked news in a high-resource language (English). We start by curating a Chinese real&fake news dataset according to existing fact-checking information. Then, we propose a deep learning framework named CrossFake to jointly encode the cross-lingual news body texts and capture the news content as much as possible. Empirical results on our dataset demonstrate the effectiveness of CorssFake under the cross-lingual setting and it also outperforms several monolingual and cross-lingual fake news detectors. The dataset is available at https://github.com/YingtongDou/CrossFake.

翻译：COVID-19大流行给全球公众健康带来巨大威胁。与此同时,与这一大流行有关的大量错误信息与这种大流行有关,鼓吹毫无根据或不科学的说法。即使是主要的社交媒体和新闻媒体也作出额外努力,破除COVID-19错误信息,大部分事实核对信息是英文,而一些未更新的COVID-19大流行信息仍然以其他语言传播,威胁移民社区和发展中国家信息不全的人的健康。在本文中,我们第一次尝试用一种低资源语言(中文)来检测COVID-19错误信息,但只能使用高资源语言(英文)的经查实的新闻。我们首先根据现有的事实核对信息整理中国真实和假新闻数据集。然后,我们提出一个名为Crosfake的深层次学习框架,以联合编码跨语言新闻机构文本并尽可能地捕捉新闻内容。我们的数据集的“经验”显示跨语言设置下的CorsFake的有效性,它也超越了几个单语和交叉语言的假新闻探测器。数据可在 https://Yging/Dsington。

0

相关内容

COVID-19

自然语言处理顶会EMNLP2020接受论文列表，754篇论文都在这儿了！

自然语言处理顶会EMNLP2020接受论文列表，754篇论文都在这儿了！

专知会员服务

28+阅读 · 2020年10月26日

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

专知会员服务

41+阅读 · 2020年7月14日

COVID-19文献知识图谱构建，UIUC-哥伦比亚大学

COVID-19文献知识图谱构建，UIUC-哥伦比亚大学

专知会员服务

43+阅读 · 2020年7月2日

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

专知会员服务

79+阅读 · 2020年2月12日

【哥伦比亚大学应用机器学习课程2020】《COMS W4995 Applied Machine Learning Spring 2020》

【哥伦比亚大学应用机器学习课程2020】《COMS W4995 Applied Machine Learning Spring 2020》

专知会员服务

26+阅读 · 2020年1月23日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

计算机 | 国际会议信息5条

计算机 | 国际会议信息5条

Call4Papers

3+阅读 · 2019年7月3日

计算机 | EMNLP 2019等国际会议信息6条

计算机 | EMNLP 2019等国际会议信息6条

Call4Papers

18+阅读 · 2019年4月26日

计算机 | USENIX Security 2020等国际会议信息5条

计算机 | USENIX Security 2020等国际会议信息5条

Call4Papers

7+阅读 · 2019年4月25日

计算机 | ISMAR 2019等国际会议信息8条

计算机 | ISMAR 2019等国际会议信息8条

Call4Papers

3+阅读 · 2019年3月5日

计算机类 | SIGMETRICS 2019等国际会议信息7条

计算机类 | SIGMETRICS 2019等国际会议信息7条

Call4Papers

9+阅读 · 2018年10月23日

计算机 | CCF推荐会议信息10条

计算机 | CCF推荐会议信息10条

Call4Papers

5+阅读 · 2018年10月18日

【论文推荐】最新六篇聊天机器人相关论文—弱监督信息、内容驱动、对话管理系统、可扩展情感序列到序列、自主性

【论文推荐】最新六篇聊天机器人相关论文—弱监督信息、内容驱动、对话管理系统、可扩展情感序列到序列、自主性

专知

9+阅读 · 2018年5月12日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

【今日新增】计算机领域国际会议截稿信息

【今日新增】计算机领域国际会议截稿信息

Call4Papers

9+阅读 · 2017年7月21日

Multimodal Emergent Fake News Detection via Meta Neural Process Networks

Arxiv

6+阅读 · 2021年6月22日

Mining Dual Emotion for Fake News Detection

Arxiv

13+阅读 · 2020年10月19日

Few-shot Scene-adaptive Anomaly Detection

Few-shot Scene-adaptive Anomaly Detection

Arxiv

8+阅读 · 2020年7月15日

COVID-Net: A Tailored Deep Convolutional Neural Network Design for Detection of COVID-19 Cases from Chest Radiography Images

Arxiv

6+阅读 · 2020年3月22日

Imbalance Problems in Object Detection: A Review

Arxiv

25+阅读 · 2020年3月11日

Credibility-based Fake News Detection

Credibility-based Fake News Detection

Arxiv

3+阅读 · 2019年11月2日

Deep Learning for Deepfakes Creation and Detection

Deep Learning for Deepfakes Creation and Detection

Arxiv

6+阅读 · 2019年9月25日

Object Detection in 20 Years: A Survey

Object Detection in 20 Years: A Survey

Arxiv

48+阅读 · 2019年5月13日

A Resource-Light Method for Cross-Lingual Semantic Textual Similarity

Arxiv

3+阅读 · 2018年1月19日

Fluency-Guided Cross-Lingual Image Captioning

Arxiv

3+阅读 · 2017年8月15日

VIP会员

文章信息

相关主题

深度学习框架

相关VIP内容

自然语言处理顶会EMNLP2020接受论文列表，754篇论文都在这儿了！

自然语言处理顶会EMNLP2020接受论文列表，754篇论文都在这儿了！

专知会员服务

28+阅读 · 2020年10月26日

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

【论文】持续学习的图神经网络用于检测社交媒体的假新闻，Graph Neural Networks with Continual Learning for Fake News Detection from Social Media

专知会员服务

41+阅读 · 2020年7月14日

COVID-19文献知识图谱构建，UIUC-哥伦比亚大学

COVID-19文献知识图谱构建，UIUC-哥伦比亚大学

专知会员服务

43+阅读 · 2020年7月2日

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

专知会员服务

79+阅读 · 2020年2月12日

【哥伦比亚大学应用机器学习课程2020】《COMS W4995 Applied Machine Learning Spring 2020》

【哥伦比亚大学应用机器学习课程2020】《COMS W4995 Applied Machine Learning Spring 2020》

专知会员服务

26+阅读 · 2020年1月23日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

自动驾驶轨迹规划中的基础模型：进展综述与开放挑战

《用于提升多域战备的大型语言模型辅助场景生成器》报告

【斯坦福博士论文】为人类使用优化 AI 模型

国防领域人工智能规模化应用的理论与实践

相关资讯

计算机 | 国际会议信息5条

计算机 | 国际会议信息5条

Call4Papers

3+阅读 · 2019年7月3日

计算机 | EMNLP 2019等国际会议信息6条

计算机 | EMNLP 2019等国际会议信息6条

Call4Papers

18+阅读 · 2019年4月26日

计算机 | USENIX Security 2020等国际会议信息5条

计算机 | USENIX Security 2020等国际会议信息5条

Call4Papers

7+阅读 · 2019年4月25日

计算机 | ISMAR 2019等国际会议信息8条

计算机 | ISMAR 2019等国际会议信息8条

Call4Papers

3+阅读 · 2019年3月5日

计算机类 | SIGMETRICS 2019等国际会议信息7条

计算机类 | SIGMETRICS 2019等国际会议信息7条

Call4Papers

9+阅读 · 2018年10月23日

计算机 | CCF推荐会议信息10条

计算机 | CCF推荐会议信息10条

Call4Papers

5+阅读 · 2018年10月18日

【论文推荐】最新六篇聊天机器人相关论文—弱监督信息、内容驱动、对话管理系统、可扩展情感序列到序列、自主性

【论文推荐】最新六篇聊天机器人相关论文—弱监督信息、内容驱动、对话管理系统、可扩展情感序列到序列、自主性

专知

9+阅读 · 2018年5月12日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

【今日新增】计算机领域国际会议截稿信息

【今日新增】计算机领域国际会议截稿信息

Call4Papers

9+阅读 · 2017年7月21日

相关论文

Multimodal Emergent Fake News Detection via Meta Neural Process Networks

Arxiv

6+阅读 · 2021年6月22日

Mining Dual Emotion for Fake News Detection

Arxiv

13+阅读 · 2020年10月19日

Few-shot Scene-adaptive Anomaly Detection

Few-shot Scene-adaptive Anomaly Detection

Arxiv

8+阅读 · 2020年7月15日

COVID-Net: A Tailored Deep Convolutional Neural Network Design for Detection of COVID-19 Cases from Chest Radiography Images

Arxiv

6+阅读 · 2020年3月22日

Imbalance Problems in Object Detection: A Review

Arxiv

25+阅读 · 2020年3月11日

Credibility-based Fake News Detection

Credibility-based Fake News Detection

Arxiv

3+阅读 · 2019年11月2日

Deep Learning for Deepfakes Creation and Detection

Deep Learning for Deepfakes Creation and Detection

Arxiv

6+阅读 · 2019年9月25日

Object Detection in 20 Years: A Survey

Object Detection in 20 Years: A Survey

Arxiv

48+阅读 · 2019年5月13日

A Resource-Light Method for Cross-Lingual Semantic Textual Similarity

Arxiv

3+阅读 · 2018年1月19日

Fluency-Guided Cross-Lingual Image Captioning

Arxiv

3+阅读 · 2017年8月15日

微信扫码咨询专知VIP会员