你有多了解你的汇总数据集? (How well do you know your summarization datasets?) - 专知论文

会员服务 ·

0

Performer · state-of-the-art · MoDELS · 噪声 · ROUGE ·

2021 年 6 月 21 日

How well do you know your summarization datasets?

翻译：你有多了解你的汇总数据集?

Priyam Tejaswin,Dhruv Naik,Pengfei Liu

from arxiv, Accepted into Findings of ACL-IJCNLP 2021

State-of-the-art summarization systems are trained and evaluated on massive datasets scraped from the web. Despite their prevalence, we know very little about the underlying characteristics (data noise, summarization complexity, etc.) of these datasets, and how these affect system performance and the reliability of automatic metrics like ROUGE. In this study, we manually analyze 600 samples from three popular summarization datasets. Our study is driven by a six-class typology which captures different noise types (missing facts, entities) and degrees of summarization difficulty (extractive, abstractive). We follow with a thorough analysis of 27 state-of-the-art summarization models and 5 popular metrics, and report our key insights: (1) Datasets have distinct data quality and complexity distributions, which can be traced back to their collection process. (2) The performance of models and reliability of metrics is dependent on sample complexity. (3) Faithful summaries often receive low scores because of the poor diversity of references. We release the code, annotated data and model outputs.

翻译：尽管这些数据集普遍存在,但我们对这些数据集的基本特征(数据噪音、汇总复杂性等)知之甚少,这些特征如何影响系统性能和像ROUGE这样的自动计量的可靠性。在这项研究中,我们手动分析了三个广受欢迎的汇总数据集的600个样本。我们的研究是由六类类型类型驱动的,它捕捉了不同的噪音类型(缺乏事实、实体)和汇总难度(极端、抽象)的程度。我们对这些数据集的基本特征(数据噪音、汇总复杂性等)进行透彻分析,并报告我们的主要见解:(1)数据集具有不同的数据质量和复杂性分布,可追溯到它们的收集过程。(2) 模型的性能和可靠性取决于抽样的复杂性。(3) 可靠的摘要往往由于引用的多样性而得分较低。我们发布了代码、附加说明的数据和模型产出。

0

相关内容

Performer

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

113+阅读 · 2021年4月17日

白皮书 | 《工业互联网数据安全白皮书（2020）》（附下载）

白皮书 | 《工业互联网数据安全白皮书（2020）》（附下载）

专知会员服务

65+阅读 · 2021年1月17日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

《Hands-On Machine Learning with Scikit-Learn and TensorFlow》Scikit-Learn与TensorFlow机器学习实用指南

《Hands-On Machine Learning with Scikit-Learn and TensorFlow》Scikit-Learn与TensorFlow机器学习实用指南

专知会员服务

65+阅读 · 2019年10月27日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

已删除

将门创投

4+阅读 · 2018年11月15日

Czech News Dataset for Semantic Textual Similarity

Arxiv

0+阅读 · 2021年8月23日

Term Interrelations and Trends in Software Engineering

Arxiv

0+阅读 · 2021年8月21日

Uncertainty-Aware Reliable Text Classification

Arxiv

8+阅读 · 2021年7月15日

Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

Arxiv

6+阅读 · 2020年3月1日

The Knowledge Within: Methods for Data-Free Model Compression

The Knowledge Within: Methods for Data-Free Model Compression

Arxiv

3+阅读 · 2019年12月3日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis

Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis

Arxiv

4+阅读 · 2019年3月27日

Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph

Arxiv

6+阅读 · 2018年10月4日

How do you correct run-on sentences it's not as easy as it seems

Arxiv

4+阅读 · 2018年9月21日

Learning to Extract Coherent Summary via Deep Reinforcement Learning

Arxiv

6+阅读 · 2018年4月19日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

113+阅读 · 2021年4月17日

白皮书 | 《工业互联网数据安全白皮书（2020）》（附下载）

白皮书 | 《工业互联网数据安全白皮书（2020）》（附下载）

专知会员服务

65+阅读 · 2021年1月17日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

《Hands-On Machine Learning with Scikit-Learn and TensorFlow》Scikit-Learn与TensorFlow机器学习实用指南

《Hands-On Machine Learning with Scikit-Learn and TensorFlow》Scikit-Learn与TensorFlow机器学习实用指南

专知会员服务

65+阅读 · 2019年10月27日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACMMM2025教程】打击网络虚假信息视频：特征分析、检测与防范，170页ppt

海军无人系统：海上作战的演进而非革命

Nature 子刊 | SciToolAgent:知识图谱引导的科学工具智能体

多媒体顶会ACM Multimedia 2025各大奖项揭晓！格拉斯哥大学等获最佳论文，中科院自动化所等获最佳学生论文

相关资讯

已删除

将门创投

4+阅读 · 2018年11月15日

相关论文

Czech News Dataset for Semantic Textual Similarity

Arxiv

0+阅读 · 2021年8月23日

Term Interrelations and Trends in Software Engineering

Arxiv

0+阅读 · 2021年8月21日

Uncertainty-Aware Reliable Text Classification

Arxiv

8+阅读 · 2021年7月15日

Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

Arxiv

6+阅读 · 2020年3月1日

The Knowledge Within: Methods for Data-Free Model Compression

The Knowledge Within: Methods for Data-Free Model Compression

Arxiv

3+阅读 · 2019年12月3日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis

Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis

Arxiv

4+阅读 · 2019年3月27日

Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph

Arxiv

6+阅读 · 2018年10月4日

How do you correct run-on sentences it's not as easy as it seems

Arxiv

4+阅读 · 2018年9月21日

Learning to Extract Coherent Summary via Deep Reinforcement Learning

Arxiv

6+阅读 · 2018年4月19日

微信扫码咨询专知VIP会员