With short video platforms becoming one of the important channels for news sharing, major short video platforms in China have gradually become new breeding grounds for fake news. However, it is not easy to distinguish short video rumors due to the great amount of information and features contained in short videos, as well as the serious homogenization and similarity of features among videos. In order to mitigate the spread of short video rumors, our group decides to detect short video rumors by constructing multimodal feature fusion and introducing external knowledge after considering the advantages and disadvantages of each algorithm. The ideas of detection are as follows: (1) dataset creation: to build a short video dataset with multiple features; (2) multimodal rumor detection model: firstly, we use TSN (Temporal Segment Networks) video coding model to extract video features; then, we use OCR (Optical Character Recognition) and ASR (Automatic Character Recognition) to extract video features. Recognition) and ASR (Automatic Speech Recognition) fusion to extract text, and then use the BERT model to fuse text features with video features (3) Finally, use contrast learning to achieve distinction: first crawl external knowledge, then use the vector database to achieve the introduction of external knowledge and the final structure of the classification output. Our research process is always oriented to practical needs, and the related knowledge results will play an important role in many practical scenarios such as short video rumor identification and social opinion control.
翻译:随着短视频平台成为新闻分享的重要渠道之一,中国的主要短视频平台逐渐成为虚假新闻的新滋生地。然而,由于短视频中包含大量的信息和特征,以及视频之间的严重同质化和特征相似性,短视频谣言并不容易辨别。为了减轻短视频谣言的传播,我们小组考虑了每种算法的优缺点后,通过构建多模态特征融合和引入外部知识来检测短视频谣言。检测思路如下:(1)数据集创建:构建一个含有多种特征的短视频数据集;(2)多模态谣言检测模型:首先,我们使用TSN(Temporal Segment Networks)视频编码模型提取视频特征;然后,我们使用OCR(Optical Character Recognition)和ASR(Automatic Speech Recognition)提取文本特征,并使用BERT模型将文本特征与视频特征融合;(3)最终,使用对比学习实现区分:先抓取外部知识,然后使用向量数据库实现引入外部知识和分类输出的最终结构。我们的研究过程始终面向实际需求,相关的知识成果将在许多实际场景中发挥重要作用,如短视频谣言识别和社会舆论控制。