The new era of technology has brought us to the point where it is convenient for people to share their opinions over an abundance of platforms. These platforms have a provision for the users to express themselves in multiple forms of representations, including text, images, videos, and audio. This, however, makes it difficult for users to obtain all the key information about a topic, making the task of automatic multi-modal summarization (MMS) essential. In this paper, we present a comprehensive survey of the existing research in the area of MMS.
翻译:新的技术时代使我们到了人们可以就大量平台交流看法的地步,这些平台为用户以多种表达形式表达自己提供了条件,包括文字、图像、视频和音频,但这使得用户难以获得关于一个专题的所有关键信息,使得自动多模式合成任务至关重要。 在本文中,我们全面介绍了在MMS领域现有的研究。