UniSumm: 多任务培训前和前导出前的拼图 (UniSumm: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning)

The diverse demands of different summarization tasks and their high annotation costs are driving a need for few-shot summarization. However, despite the emergence of many summarization tasks and datasets, the current training paradigm for few-shot summarization systems ignores potentially shareable knowledge in heterogeneous datasets. To this end, we propose \textsc{UniSumm}, a unified few-shot summarization model pre-trained with multiple summarization tasks and can be prefix-tuned to excel at any few-shot summarization datasets. Meanwhile, to better evaluate few-shot summarization systems, under the principles of diversity and robustness, we assemble and publicize a new benchmark \textsc{SummZoo}. It consists of $8$ diverse summarization tasks with multiple sets of few-shot samples for each task, covering both monologue and dialogue domains. Experimental results and ablation studies show that \textsc{UniSumm} outperforms strong baseline systems by a large margin across all tasks in \textsc{SummZoo} under both automatic and human evaluations. We release our code and benchmark at \url{https://github.com/microsoft/UniSumm}.

翻译：不同统称任务的不同要求及其高额注解成本正在导致需要少截面概括。然而,尽管出现了许多简单总结任务和数据集,但目前对少截面概括系统的培训模式忽略了在多样数据集中可能共享的知识。为此,我们提议了一个统一的微截面概括模型,先经过多重总结任务的培训,然后是统一的几截面总结任务,然后可以提前调整,以在任何少截面总结数据集中取得优异成绩。同时,为了更好地评估少截面概括系统,在多样性和稳健原则下,我们收集并公布一个新的基准\textsc{SummZoo}。它包含8美元的多样性汇总任务,每件任务都有多套多样的少截面样本,涵盖单项和对话领域。实验结果和对比研究表明,在文本c{UnisiSummZoo} 和人类基准下,我们自动和软基码(UWe) 和软基码(UMSU) 下) 和软基码(SUL) 下) 。

相关内容

小样本学习

关注 215

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日