Online encyclopedias, such as Wikipedia, have been well-developed and researched in the last two decades. One can find any attributes or other information of a wiki item on a wiki page edited by a community of volunteers. However, the traditional text, images and tables can hardly express some aspects of an wiki item. For example, when we talk about ``Shiba Inu'', one may care more about ``How to feed it'' or ``How to train it not to protect its food''. Currently, short-video platforms have become a hallmark in the online world. Whether you're on TikTok, Instagram, Kuaishou, or YouTube Shorts, short-video apps have changed how we consume and create content today. Except for producing short videos for entertainment, we can find more and more authors sharing insightful knowledge widely across all walks of life. These short videos, which we call knowledge videos, can easily express any aspects (e.g. hair or how-to-feed) consumers want to know about an item (e.g. Shiba Inu), and they can be systematically analyzed and organized like an online encyclopedia. In this paper, we propose Kuaipedia, a large-scale multi-modal encyclopedia consisting of items, aspects, and short videos lined to them, which was extracted from billions of videos of Kuaishou (Kwai), a well-known short-video platform in China. We first collected items from multiple sources and mined user-centered aspects from millions of users' queries to build an item-aspect tree. Then we propose a new task called ``multi-modal item-aspect linking'' as an expansion of ``entity linking'' to link short videos into item-aspect pairs and build the whole short-video encyclopedia. Intrinsic evaluations show that our encyclopedia is of large scale and highly accurate. We also conduct sufficient extrinsic experiments to show how Kuaipedia can help fundamental applications such as entity typing and entity linking.
翻译:在线百科全书,例如维基百科,在过去二十年中已经得到了完善和研究。 人们可以在由志愿者团体编辑的维基页面上找到任何维基内容的属性或其他信息。 然而, 传统文本、 图像和表格很难表达维基项目的某些方面。 例如, 当我们谈论“ Shiba Inu' ” 时, 人们可能更关心“ 如何喂养它 ” 或“ 如何训练它来保护食物 ” 。 目前, 短视平台已成为在线世界的一个标志。 无论你在TikTok、 Instagram、 Kuaïhou 或YouTube Shorts 上, 短视频应用程序改变了我们今天的消费和创建内容。 除了制作简短的娱乐视频, 我们能找到更多在生活各行各行之间分享深刻知识的作者。 这些简短的视频, 我们称之为知识视频, 能够很容易表达任何短方面( 例如, 头发或多功能) 。 消费者想要从一个短路端的用户信息, 将一个短路端的视频数据转换成一个网络, 并组织一个大型的游戏的条目, 将一个网络化成一个网络。