D2S: 文档到滑坡生成器Via 查询式文本摘要 (D2S: Document-to-Slide Generation Via Query-Based Text Summarization)

Presentations are critical for communication in all areas of our lives, yet the creation of slide decks is often tedious and time-consuming. There has been limited research aiming to automate the document-to-slides generation process and all face a critical challenge: no publicly available dataset for training and benchmarking. In this work, we first contribute a new dataset, SciDuet, consisting of pairs of papers and their corresponding slides decks from recent years' NLP and ML conferences (e.g., ACL). Secondly, we present D2S, a novel system that tackles the document-to-slides task with a two-step approach: 1) Use slide titles to retrieve relevant and engaging text, figures, and tables; 2) Summarize the retrieved context into bullet points with long-form question answering. Our evaluation suggests that long-form QA outperforms state-of-the-art summarization baselines on both automated ROUGE metrics and qualitative human evaluation.

翻译：演示对于我们生活的各个领域的交流至关重要,然而,创建幻灯片甲板往往既乏味又费时,研究范围有限,旨在将文件到幻灯片的生成过程自动化,而且都面临严峻的挑战:没有可供公众查阅的用于培训和基准衡量的数据集。在这项工作中,我们首先提供一个新的数据集,SciDuet,由近年国家实验室和多边实验室会议(如ACL)的双对纸张及其相应的幻灯片甲板组成。第二,我们提出D2S,这是一个处理文件到幻灯片任务的新系统,采用两步方法:1)使用幻灯片标题检索相关和有吸引力的文本、图表和表格;2)将检索到的上下文汇总成长式问题的回答圆点。我们的评价表明,长式QA在自动的ROUGE测量和定性的人评价上都超越了最先进的总和基线。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【论文推荐】文本摘要简述

专知会员服务

69+阅读 · 2020年7月20日

【IJCAI2020】神经摘要结构性注意力，Neural Abstractive Summarization with Structural Attention

专知会员服务

33+阅读 · 2020年4月24日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日