Abstractive dialogue summarization is to generate a concise and fluent summary covering the salient information in a dialogue among two or more interlocutors. It has attracted great attention in recent years based on the massive emergence of social communication platforms and an urgent requirement for efficient dialogue information understanding and digestion. Different from news or articles in traditional document summarization, dialogues bring unique characteristics and additional challenges, including different language styles and formats, scattered information, flexible discourse structures and unclear topic boundaries. This survey provides a comprehensive investigation on existing work for abstractive dialogue summarization from scenarios, approaches to evaluations. It categorizes the task into two broad categories according to the type of input dialogues, i.e., open-domain and task-oriented, and presents a taxonomy of existing techniques in three directions, namely, injecting dialogue features, designing auxiliary training tasks and using additional data.A list of datasets under different scenarios and widely-accepted evaluation metrics are summarized for completeness. After that, the trends of scenarios and techniques are summarized, together with deep insights on correlations between extensively exploited features and different scenarios. Based on these analyses, we recommend future directions including more controlled and complicated scenarios, technical innovations and comparisons, publicly available datasets in special domains, etc.
翻译:由于社会交流平台的大规模出现以及有效对话信息理解和消化的迫切要求,近年来,由于社会交流平台的大规模出现和对有效对话信息的理解和消化的迫切需要,它引起了极大的注意。与传统文件总结中的新闻或文章不同,对话带来了独特的特点和额外的挑战,包括不同的语言风格和格式、分散的信息、灵活的讨论结构和不明确的专题界限。这项调查对目前关于抽象对话总结设想方案、评价方法等突出信息的工作进行了全面调查。它根据投入对话的类型将任务分为两大类,即开放和面向任务,并提出了三种方向现有技术的分类,即注射对话特征、设计辅助培训任务和使用额外数据。根据不同设想方案和格式、分散的信息、灵活的讨论结构和广泛接受的评价指标,对各种设想方案和技术的趋势进行了总结,并对广泛利用的特点和不同设想方案之间的相互关系进行了深入的深入了解。我们根据这些分析,建议了未来方向,包括更受控制和复杂的领域、技术创新以及公开比较等。