Video communication has been rapidly increasing over the past decade, with YouTube providing a medium where users can post, discover, share, and react to videos. There has also been an increase in the number of videos citing research articles, especially since it has become relatively commonplace for academic conferences to require video submissions. However, the relationship between research articles and YouTube videos is not clear, and the purpose of the present paper is to address this issue. We created new datasets using YouTube videos and mentions of research articles on various online platforms. We found that most of the articles cited in the videos are related to medicine and biochemistry. We analyzed these datasets through statistical techniques and visualization, and built machine learning models to predict (1) whether a research article is cited in videos, (2) whether a research article cited in a video achieves a level of popularity, and (3) whether a video citing a research article becomes popular. The best models achieved F1 scores between 80% and 94%. According to our results, research articles mentioned in more tweets and news coverage have a higher chance of receiving video citations. We also found that video views are important for predicting citations and increasing research articles' popularity and public engagement with science.
翻译:过去十年来,视频通信迅速增加,YouTube为用户提供了一个媒体,可以张贴、发现、分享和对视频作出反应。还增加了引用研究文章的视频数量,特别是由于这已成为学术会议要求提交视频的相对常见的地方。然而,研究文章与YouTube视频之间的关系并不明确,本文的目的是解决这一问题。我们利用YouTube视频创建了新的数据集,并在各种在线平台上提及研究文章。我们发现视频中引用的大部分文章与医学和生物化学有关。我们通过统计技术和可视化分析了这些数据集,并建立了机器学习模型,以预测:(1) 视频中是否引用了研究文章,(2) 视频中引用的研究文章是否达到受欢迎程度,(3) 引用研究文章的视频是否受到欢迎,(3) 最佳模型达到F1评分80%至94%。根据我们的结果,更多的推文和新闻报道中提到的研究文章获得视频引用的机会更大。我们还发现,视频观点对于预测引用和增加研究文章的受欢迎程度以及公众与科学的接触很重要。