以人文反馈对书籍进行递递递性摘要 (Recursively Summarizing Books with Human Feedback)

A major challenge for scaling machine learning is training models to perform tasks that are very difficult or time-consuming for humans to evaluate. We present progress on this problem on the task of abstractive summarization of entire fiction novels. Our method combines learning from human feedback with recursive task decomposition: we use models trained on smaller parts of the task to assist humans in giving feedback on the broader task. We collect a large volume of demonstrations and comparisons from human labelers, and fine-tune GPT-3 using behavioral cloning and reward modeling to do summarization recursively. At inference time, the model first summarizes small sections of the book and then recursively summarizes these summaries to produce a summary of the entire book. Our human labelers are able to supervise and evaluate the models quickly, despite not having read the entire books themselves. Our resulting model generates sensible summaries of entire books, even matching the quality of human-written summaries in a few cases ($\sim5\%$ of books). We achieve state-of-the-art results on the recent BookSum dataset for book-length summarization. A zero-shot question-answering model using these summaries achieves state-of-the-art results on the challenging NarrativeQA benchmark for answering questions about books and movie scripts. We release datasets of samples from our model.

翻译：扩大机器学习的重大挑战是培训模型,以完成非常困难或耗费时间的任务,供人类评估。我们介绍这个问题在抽象总结整个小说小说的任务上的进展。我们的方法将人类反馈的学习与递归任务分解结合起来:我们使用在任务较小部分上受过训练的模型,协助人类就更广泛的任务提供反馈。我们收集了大量的演示和人类标签师的比较,以及使用行为克隆和奖励模型进行总结的精细GPT-3, 反复进行。在推断时,模型首先总结书的小部分,然后循环总结这些摘要,以产生整个书的摘要。我们的人类标签员能够快速监督和评价模型,尽管没有阅读全部书籍。我们产生的模型生成了整个书籍的合理摘要,甚至与少数情况下的人写摘要的质量相匹配(模型$sim5 ⁇ 美元)。我们从最近的书本数据集中取得了最新的艺术成果,用于图书摘要的总结,然后循环总结这些摘要。我们人类标签能够快速地监督和评价模型,我们用这些具有挑战性的版本的样本,我们用这些模型来得出了这些具有挑战性的问题。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【IJCAI2020】神经摘要结构性注意力，Neural Abstractive Summarization with Structural Attention

专知会员服务

33+阅读 · 2020年4月24日

【Google】无监督机器翻译，Unsupervised Machine Translation

专知会员服务

36+阅读 · 2020年3月3日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【反馈循环自编码器】FEEDBACK RECURRENT AUTOENCODER

专知会员服务

23+阅读 · 2020年1月28日