断断、 Perturb、构建: 通过问题分解自动干扰理性路径 (Break, Perturb, Build: Automatic Perturbation of Reasoning Paths through Question Decomposition) - 专知论文

会员服务 ·

0

Performer · 样例 · MoDELS · 原点 · 可理解性 ·

2021 年 7 月 29 日

Break, Perturb, Build: Automatic Perturbation of Reasoning Paths through Question Decomposition

翻译：断断、 Perturb、构建: 通过问题分解自动干扰理性路径

Mor Geva,Tomer Wolfson,Jonathan Berant

Recent efforts to create challenge benchmarks that test the abilities of natural language understanding models have largely depended on human annotations. In this work, we introduce the "Break, Perturb, Build" (BPB) framework for automatic reasoning-oriented perturbation of question-answer pairs. BPB represents a question by decomposing it into the reasoning steps that are required to answer it, symbolically perturbs the decomposition, and then generates new question-answer pairs. We demonstrate the effectiveness of BPB by creating evaluation sets for three reading comprehension (RC) benchmarks, generating thousands of high-quality examples without human intervention. We evaluate a range of RC models on our evaluation sets, which reveals large performance gaps on generated examples compared to the original data. Moreover, symbolic perturbations enable fine-grained analysis of the strengths and limitations of models. Last, augmenting the training data with examples generated by BPB helps close performance gaps, without any drop on the original data distribution.

翻译：最近为建立挑战基准以测试自然语言理解模型的能力所作的努力主要取决于人文说明。在这项工作中,我们引入了“突破、 Perturb、build”框架(BBB),以自动推理方式干扰问答对配。BPB代表了一个问题,将它分解为必要的推理步骤,象征性地干扰了分解,然后产生了新的问答对配。我们通过为三种阅读理解(RC)基准建立评价组,产生了数千个高质量的实例,而没有人类的干预。我们评估了我们评价组的一系列RC模型,这些模型显示与原始数据相比,在生成的示例上存在很大的绩效差距。此外,象征性的扰动使得能够对模型的长处和局限性进行细微分析。最后,用BPB生成的示例来补充培训数据有助于缩小绩效差距,而没有减少原始数据分布。

0

相关内容

Performer

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

本周论文推荐 -- 对抗生成网络、知识图谱补全、对话系统、文本生成

本周论文推荐 -- 对抗生成网络、知识图谱补全、对话系统、文本生成

深度学习自然语言处理

8+阅读 · 2020年1月4日

文本生成公开数据集/开源工具/经典论文详细列表分享

文本生成公开数据集/开源工具/经典论文详细列表分享

深度学习与NLP

30+阅读 · 2019年9月22日

逼真照片随手画，马良神笔已上线 | 点击收获这份英伟达GauGAN开源代码

逼真照片随手画，马良神笔已上线 | 点击收获这份英伟达GauGAN开源代码

量子位

3+阅读 · 2019年4月17日

深度学习自然语言处理阅读清单

深度学习自然语言处理阅读清单

专知

23+阅读 · 2019年1月13日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

最前沿的深度学习论文、架构及资源分享

最前沿的深度学习论文、架构及资源分享

深度学习与NLP

13+阅读 · 2018年1月25日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

EDDA: Explanation-driven Data Augmentation to Improve Explanation Faithfulness

Arxiv

0+阅读 · 2021年9月24日

A Diversity-Enhanced and Constraints-Relaxed Augmentation for Low-Resource Classification

A Diversity-Enhanced and Constraints-Relaxed Augmentation for Low-Resource Classification

Arxiv

0+阅读 · 2021年9月24日

Quantifying point cloud realism through adversarially learned latent representations

Arxiv

0+阅读 · 2021年9月24日

Generating Rationales in Visual Question Answering

Generating Rationales in Visual Question Answering

Arxiv

5+阅读 · 2020年4月4日

End-to-End Open-Domain Question Answering with BERTserini

End-to-End Open-Domain Question Answering with BERTserini

Arxiv

3+阅读 · 2019年9月18日

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

Arxiv

3+阅读 · 2019年9月10日

Learning Conditioned Graph Structures for Interpretable Visual Question Answering

Arxiv

4+阅读 · 2018年6月20日

Reciprocal Attention Fusion for Visual Question Answering

Arxiv

5+阅读 · 2018年5月11日

Visual Question Reasoning on General Dependency Tree

Arxiv

6+阅读 · 2018年3月31日

An Interpretable Reasoning Network for Multi-Relation Question Answering

Arxiv

17+阅读 · 2018年1月15日

VIP会员

文章信息

相关主题

相关VIP内容

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

《多体环境下定位导航授时（PNT）系统研究》228页

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

相关资讯

本周论文推荐 -- 对抗生成网络、知识图谱补全、对话系统、文本生成

本周论文推荐 -- 对抗生成网络、知识图谱补全、对话系统、文本生成

深度学习自然语言处理

8+阅读 · 2020年1月4日

文本生成公开数据集/开源工具/经典论文详细列表分享

文本生成公开数据集/开源工具/经典论文详细列表分享

深度学习与NLP

30+阅读 · 2019年9月22日

逼真照片随手画，马良神笔已上线 | 点击收获这份英伟达GauGAN开源代码

逼真照片随手画，马良神笔已上线 | 点击收获这份英伟达GauGAN开源代码

量子位

3+阅读 · 2019年4月17日

深度学习自然语言处理阅读清单

深度学习自然语言处理阅读清单

专知

23+阅读 · 2019年1月13日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

最前沿的深度学习论文、架构及资源分享

最前沿的深度学习论文、架构及资源分享

深度学习与NLP

13+阅读 · 2018年1月25日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

EDDA: Explanation-driven Data Augmentation to Improve Explanation Faithfulness

Arxiv

0+阅读 · 2021年9月24日

A Diversity-Enhanced and Constraints-Relaxed Augmentation for Low-Resource Classification

A Diversity-Enhanced and Constraints-Relaxed Augmentation for Low-Resource Classification

Arxiv

0+阅读 · 2021年9月24日

Quantifying point cloud realism through adversarially learned latent representations

Arxiv

0+阅读 · 2021年9月24日

Generating Rationales in Visual Question Answering

Generating Rationales in Visual Question Answering

Arxiv

5+阅读 · 2020年4月4日

End-to-End Open-Domain Question Answering with BERTserini

End-to-End Open-Domain Question Answering with BERTserini

Arxiv

3+阅读 · 2019年9月18日

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

Arxiv

3+阅读 · 2019年9月10日

Learning Conditioned Graph Structures for Interpretable Visual Question Answering

Arxiv

4+阅读 · 2018年6月20日

Reciprocal Attention Fusion for Visual Question Answering

Arxiv

5+阅读 · 2018年5月11日

Visual Question Reasoning on General Dependency Tree

Arxiv

6+阅读 · 2018年3月31日

An Interpretable Reasoning Network for Multi-Relation Question Answering

Arxiv

17+阅读 · 2018年1月15日

微信扫码咨询专知VIP会员