With short video platforms becoming one of the important channels for news sharing, major short video platforms in China have gradually become new breeding grounds for fake news. However, it is not easy to distinguish short video rumors due to the great amount of information and features contained in short videos, as well as the serious homogenization and similarity of features among videos. In order to mitigate the spread of short video rumors, our group decides to detect short video rumors by constructing multimodal feature fusion and introducing external knowledge after considering the advantages and disadvantages of each algorithm. The ideas of detection are as follows: (1) dataset creation: to build a short video dataset with multiple features; (2) multimodal rumor detection model: firstly, we use TSN (Temporal Segment Networks) video coding model to extract video features; then, we use OCR (Optical Character Recognition) and ASR (Automatic Character Recognition) to extract video features. Recognition) and ASR (Automatic Speech Recognition) fusion to extract text, and then use the BERT model to fuse text features with video features (3) Finally, use contrast learning to achieve distinction: first crawl external knowledge, then use the vector database to achieve the introduction of external knowledge and the final structure of the classification output. Our research process is always oriented to practical needs, and the related knowledge results will play an important role in many practical scenarios such as short video rumor identification and social opinion control.
翻译:随着短视频平台成为新闻共享的重要渠道之一,中国主要短视频平台逐渐成为虚假新闻的新温床。然而,由于短视频中包含了大量的信息和特征,以及特征之间的严重同质化和相似性,因此很难区分短视频谣言。为了缓解短视频谣言的传播,我们的团队考虑到每个算法的优点和缺点,构建多模态特征融合并引入外部知识来进行短视频谣言检测。检测的思路如下:(1)数据集的创建:构建包含多种特征的短视频数据集;(2)多模态谣言检测模型:首先,我们使用时域段网络(TSN)视频编码模型提取视频特征;接着,我们采用OCR(光学字符识别)和ASR(自动语音识别)来提取视频特征和文本特征,并使用BERT模型将文本特征与视频特征融合;(3)最后,采用对比学习实现区分:首先抓取外部知识,随后使用向量数据库实现对外部知识的引入,最终结构为分类输出。我们的研究过程始终是面向实践需求的,相关知识研究结果必将在诸多实践场景,如短视频谣言鉴别、社会舆情调控等方面起到重要的作用。