Sentence summarization aims at compressing a long sentence into a short one that keeps the main gist, and has extensive real-world applications such as headline generation. In previous work, researchers have developed various approaches to improve the ROUGE score, which is the main evaluation metric for summarization, whereas controlling the summary length has not drawn much attention. In our work, we address a new problem of explicit character-level length control for summarization, and propose a dynamic programming algorithm based on the Connectionist Temporal Classification (CTC) model. Results show that our approach not only achieves higher ROUGE scores but also yields more complete sentences.
翻译:句子总和旨在将长句压缩成一个短句,保持主语不变,并具有广泛的真实世界应用,如标题一代。在以往的工作中,研究人员制定了各种方法来改进ROUGE评分,这是总结的主要评价尺度,而控制摘要长度没有引起多少注意。在我们的工作中,我们处理一个明确性格水平的总结长度控制的新问题,并提议一个基于“连接时间分类(CTC)”模式的动态编程算法。 结果显示,我们的方法不仅能达到更高的ROUGE评分,而且能产生更完整的句子。