Social media platforms are vulnerable to fake news dissemination, which causes negative consequences such as panic and wrong medication in the healthcare domain. Therefore, it is important to automatically detect fake news in an early stage before they get widely spread. This paper analyzes the impact of incorporating content information, prior knowledge, and credibility of sources into models for the early detection of fake news. We propose a framework modeling those features by using BERT language model and external sources, namely Simple English Wikipedia and source reliability tags. The conducted experiments on CONSTRAINT datasets demonstrated the benefit of integrating these features for the early detection of fake news in the healthcare domain.
翻译:社交媒体平台很容易被假新闻传播,这在医疗领域造成恐慌和错误药物等负面后果,因此,在假新闻广泛传播之前,必须在早期阶段自动检测这些假新闻。本文分析将内容信息、事先知识和来源的可信度纳入早期发现假新闻模式的影响。我们建议采用BERT语言模式和外部来源,即简单英语维基百科和来源可靠性标签,来模拟这些特征。对CONSTRAINT数据集进行的实验显示了将这些功能纳入医疗领域早期发现假新闻的好处。