Proposal of large-scale datasets has facilitated research on deep neural models for news summarization. Deep learning can also be potentially useful for spoken dialogue summarization, which can benefit a range of real-life scenarios including customer service management and medication tracking. To this end, we propose DialogSum, a large-scale labeled dialogue summarization dataset. We conduct empirical analysis on DialogSum using state-of-the-art neural summarizers. Experimental results show unique challenges in dialogue summarization, such as spoken terms, special discourse structures, coreferences and ellipsis, pragmatics and social commonsense, which require specific representation learning technologies to better deal with.
翻译:深度学习对口语对话总结也可能有用,这有利于一系列现实生活情景,包括客户服务管理和药物跟踪。为此,我们提议 " DialogSum ",这是一个大规模标记的对话总结数据集。我们使用最新神经总结器对 " DialogSum " 进行经验分析。实验结果显示,对话总结存在独特的挑战,如口语、特别谈话结构、共同参照和省略、实用和社会常识,这需要具体的代表性学习技术才能更好地处理。