The significance of social media has increased manifold in the past few decades as it helps people from even the most remote corners of the world stay connected. With the COVID-19 pandemic raging, social media has become more relevant and widely used than ever before, and along with this, there has been a resurgence in the circulation of fake news and tweets that demand immediate attention. In this paper, we describe our Fake News Detection system that automatically identifies whether a tweet related to COVID-19 is "real" or "fake", as a part of CONSTRAINT COVID19 Fake News Detection in English challenge. We have used an ensemble model consisting of pre-trained models that has helped us achieve a joint 8th position on the leader board. We have achieved an F1-score of 0.9831 against a top score of 0.9869. Post completion of the competition, we have been able to drastically improve our system by incorporating a novel heuristic algorithm based on username handles and link domains in tweets fetching an F1-score of 0.9883 and achieving state-of-the art results on the given dataset.
翻译:在过去几十年中,社交媒体的重要性增加了,因为它帮助了世界最偏远角落的人们保持联系。随着COVID-19大流行的肆虐,社交媒体变得比以往更加相关和广泛使用,与此同时,虚假新闻和需要立即关注的推特的发行量也重新出现。在本文中,我们描述了我们的假新闻探测系统,该系统自动确定与COVID-19有关的推特是“真实”还是“假”,作为SONTRAINT COVID19 Fake新闻在英语挑战中的探测的一部分。我们使用了由预先培训的模式组成的混合模型,帮助我们在领导委员会中取得联合第八位职位。我们已经实现了0.9831的F1,而最高分为0.9869。 竞争结束后,我们通过在获取F1分的0.9883分和在给定数据集上取得最新艺术成果,通过在用户名控控点和链接域中纳入新的超音法来大幅度改进了我们的系统。