Recent advances in deep learning, and especially the invention of encoder-decoder architectures, has significantly improved the performance of abstractive summarization systems. While the majority of research has focused on written documents, we have observed an increasing interest in the summarization of dialogues and multi-party conversation over the past few years. A system that could reliably transform the audio or transcript of a human conversation into an abridged version that homes in on the most important points of the discussion would be valuable in a wide variety of real-world contexts, from business meetings to medical consultations to customer service calls. This paper focuses on abstractive summarization for multi-party meetings, providing a survey of the challenges, datasets and systems relevant to this task and a discussion of promising directions for future study.
翻译:最近深层学习的进展,特别是编码器-解码器结构的发明,大大改善了抽象总结系统的性能。虽然大多数研究侧重于书面文件,但我们注意到过去几年来对对话和多党对话的总结越来越感兴趣。一个可以可靠地将人类对话的音频或文字记录转换为简略版本的系统,在从商业会议到医疗咨询到客户服务电话等广泛的现实环境中,将最重要的讨论要点带入家庭将很有价值。本文件侧重于多党会议的抽象总结,对与这项任务有关的挑战、数据集和系统进行了调查,并讨论了今后研究的有希望的方向。