【ACL2020-浙大-微软】多轮对话推理数据集，MuTual: A Dataset for Multi-Turn Dialogue Reasoning - 专知VIP

会员服务 ·

5

自然语言处理 · 任务对话系统 · 深度学习 ·

2020 年 4 月 10 日

【ACL2020-浙大-微软】多轮对话推理数据集，MuTual: A Dataset for Multi-Turn Dialogue Reasoning

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

题目： MuTual: A Dataset for Multi-Turn Dialogue Reasoning

摘要： 近年来，非任务导向的对话系统取得了巨大的成功，这得益于大量可访问的对话数据和深度学习技术的发展。在给定的上下文中，当前的系统能够产生相关的、流畅的响应，但是由于推理能力较弱，有时会出现逻辑错误。为了便于会话推理的研究，我们引入了一个用于多回合对话推理的新数据集MuTual，包括8,860个基于中国学生英语听力考试的手动注释对话。与以前的非面向任务的对话系统的基准测试相比，MuTual测试更具挑战性，因为它需要一个能够处理各种推理问题的模型。实证结果表明，最先进的推理方法只能达到71%，远远落后于人类94%的表现，说明推理能力还有很大的提升空间。

成为VIP会员查看完整内容

37

相关内容

自然语言处理

自然语言处理

自然语言处理（NLP）是语言学，计算机科学，信息工程和人工智能的一个子领域，与计算机和人类（自然）语言之间的相互作用有关，尤其是如何对计算机进行编程以处理和分析大量自然语言数据。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

11+阅读 · 2020年5月25日

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

专知会员服务

35+阅读 · 2020年4月30日

【论文推荐】多模态知识图谱上的端到端实体分类，End-to-End Entity Classification on Multimodal Knowledge Graphs

【论文推荐】多模态知识图谱上的端到端实体分类，End-to-End Entity Classification on Multimodal Knowledge Graphs

专知会员服务

50+阅读 · 2020年3月30日

【WWW2020-北京大学】多模态多轮对话系统，Multi-Modality in Multi-Turn Dialog

【WWW2020-北京大学】多模态多轮对话系统，Multi-Modality in Multi-Turn Dialog

专知会员服务

58+阅读 · 2020年3月13日

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

专知会员服务

33+阅读 · 2020年2月29日

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

专知会员服务

52+阅读 · 2020年1月20日

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

专知会员服务

45+阅读 · 2020年1月15日

【伯克利】用于文本推理的神经模块网络，Neural Module Networks for Reasoning over Text

【伯克利】用于文本推理的神经模块网络，Neural Module Networks for Reasoning over Text

专知会员服务

35+阅读 · 2019年12月10日

【AAAI2010接受论文】故事实现：将情节事件展开成句子（Story Realization: Expanding Plot Events into Sentences）

【AAAI2010接受论文】故事实现：将情节事件展开成句子（Story Realization: Expanding Plot Events into Sentences）

专知会员服务

8+阅读 · 2019年11月15日

【AAAI2020接受论文】隐式关系语言模型，CMU&微软，Latent Relation Language Models

【AAAI2020接受论文】隐式关系语言模型，CMU&微软，Latent Relation Language Models

专知会员服务

54+阅读 · 2019年11月12日

AI更懂人话：谷歌发布全新对话数据集，模仿智能助理

AI更懂人话：谷歌发布全新对话数据集，模仿智能助理

新智元

5+阅读 · 2019年9月7日

微软机器阅读理解在一场多轮对话挑战中媲美人类

微软机器阅读理解在一场多轮对话挑战中媲美人类

微软丹棱街5号

19+阅读 · 2019年5月14日

微软机器阅读理解系统性能升级，刷新CoQA对话式问答挑战赛纪录

微软机器阅读理解系统性能升级，刷新CoQA对话式问答挑战赛纪录

微软研究院AI头条

4+阅读 · 2019年5月6日

自然语言处理常识推理综述论文，60页pdf

自然语言处理常识推理综述论文，60页pdf

专知

73+阅读 · 2019年4月4日

哈工大讯飞联合实验室在机器阅读理解评测SQuAD 2.0中荣登榜首

哈工大讯飞联合实验室在机器阅读理解评测SQuAD 2.0中荣登榜首

哈工大SCIR

5+阅读 · 2018年11月22日

论文浅尝 | 在生成式多跳机器阅读任务中引入外部常识知识

论文浅尝 | 在生成式多跳机器阅读任务中引入外部常识知识

开放知识图谱

10+阅读 · 2018年10月19日

知识在检索式对话系统的应用

知识在检索式对话系统的应用

微信AI

32+阅读 · 2018年9月20日

ACL 2018 | 最佳短论文SQuAD 2.0：斯坦福大学发布的机器阅读理解问答数据集

ACL 2018 | 最佳短论文SQuAD 2.0：斯坦福大学发布的机器阅读理解问答数据集

机器之心

4+阅读 · 2018年6月13日

DuReader：百度大规模的中文机器阅读理解数据集

DuReader：百度大规模的中文机器阅读理解数据集

全球人工智能

7+阅读 · 2018年5月8日

业界 | 百度提出机器阅读理解技术V-NET，登顶MS MARCO数据集榜单

业界 | 百度提出机器阅读理解技术V-NET，登顶MS MARCO数据集榜单

机器之心

6+阅读 · 2018年2月22日

Hierarchical Human Parsing with Typed Part-Relation Reasoning

Hierarchical Human Parsing with Typed Part-Relation Reasoning

Arxiv

6+阅读 · 2020年3月10日

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

Arxiv

19+阅读 · 2020年2月15日

Neural Module Networks for Reasoning over Text

Neural Module Networks for Reasoning over Text

Arxiv

9+阅读 · 2019年12月10日

Learning a Matching Model with Co-teaching for Multi-turn Response Selection in Retrieval-based Dialogue Systems

Arxiv

6+阅读 · 2019年6月11日

Commonsense Reasoning for Natural Language Understanding: A Survey of Benchmarks, Resources, and Approaches

Arxiv

16+阅读 · 2019年4月2日

Dialogue Natural Language Inference

Arxiv

7+阅读 · 2018年11月1日

Representation Learning for Visual-Relational Knowledge Graphs

Arxiv

9+阅读 · 2018年3月31日

A dataset and architecture for visual reasoning with a working memory

Arxiv

3+阅读 · 2018年3月16日

Object-based reasoning in VQA

Arxiv

6+阅读 · 2018年1月29日

Finding ReMO (Related Memory Object): A Simple Neural Architecture for Text based Reasoning

Arxiv

4+阅读 · 2018年1月26日

VIP会员

相关主题

自然语言处理

任务对话系统

相关VIP内容

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

【SIGIR2020-中科院计算所】L2R2: 利用排名进行外展推理，L2R2: Leveraging Ranking for Abductive Reasoning

专知会员服务

11+阅读 · 2020年5月25日

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

【哈工大】基于文档的对话系统(DGDS)综述，A Survey of Document Grounded Dialogue Systems (DGDS)

专知会员服务

35+阅读 · 2020年4月30日

【论文推荐】多模态知识图谱上的端到端实体分类，End-to-End Entity Classification on Multimodal Knowledge Graphs

【论文推荐】多模态知识图谱上的端到端实体分类，End-to-End Entity Classification on Multimodal Knowledge Graphs

专知会员服务

50+阅读 · 2020年3月30日

【WWW2020-北京大学】多模态多轮对话系统，Multi-Modality in Multi-Turn Dialog

【WWW2020-北京大学】多模态多轮对话系统，Multi-Modality in Multi-Turn Dialog

专知会员服务

58+阅读 · 2020年3月13日

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

【微软雷德蒙研究院】小样本自然语言生成，Few-shot Natural Language Generation for Task-Oriented Dialog

专知会员服务

33+阅读 · 2020年2月29日

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

专知会员服务

52+阅读 · 2020年1月20日

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

专知会员服务

45+阅读 · 2020年1月15日

【伯克利】用于文本推理的神经模块网络，Neural Module Networks for Reasoning over Text

【伯克利】用于文本推理的神经模块网络，Neural Module Networks for Reasoning over Text

专知会员服务

35+阅读 · 2019年12月10日

【AAAI2010接受论文】故事实现：将情节事件展开成句子（Story Realization: Expanding Plot Events into Sentences）

【AAAI2010接受论文】故事实现：将情节事件展开成句子（Story Realization: Expanding Plot Events into Sentences）

专知会员服务

8+阅读 · 2019年11月15日

【AAAI2020接受论文】隐式关系语言模型，CMU&微软，Latent Relation Language Models

【AAAI2020接受论文】隐式关系语言模型，CMU&微软，Latent Relation Language Models

专知会员服务

54+阅读 · 2019年11月12日

热门VIP内容

开通专知VIP会员享更多权益服务

《使用量化测量将传感器节点关联到融合中心的算法设计》171页

军事前沿模型

提升军事训练能力的最佳人工智能模拟工具

《社交媒体信息作战》最新48页技术报告

相关资讯

AI更懂人话：谷歌发布全新对话数据集，模仿智能助理

AI更懂人话：谷歌发布全新对话数据集，模仿智能助理

新智元

5+阅读 · 2019年9月7日

微软机器阅读理解在一场多轮对话挑战中媲美人类

微软机器阅读理解在一场多轮对话挑战中媲美人类

微软丹棱街5号

19+阅读 · 2019年5月14日

微软机器阅读理解系统性能升级，刷新CoQA对话式问答挑战赛纪录

微软机器阅读理解系统性能升级，刷新CoQA对话式问答挑战赛纪录

微软研究院AI头条

4+阅读 · 2019年5月6日

自然语言处理常识推理综述论文，60页pdf

自然语言处理常识推理综述论文，60页pdf

专知

73+阅读 · 2019年4月4日

哈工大讯飞联合实验室在机器阅读理解评测SQuAD 2.0中荣登榜首

哈工大讯飞联合实验室在机器阅读理解评测SQuAD 2.0中荣登榜首

哈工大SCIR

5+阅读 · 2018年11月22日

论文浅尝 | 在生成式多跳机器阅读任务中引入外部常识知识

论文浅尝 | 在生成式多跳机器阅读任务中引入外部常识知识

开放知识图谱

10+阅读 · 2018年10月19日

知识在检索式对话系统的应用

知识在检索式对话系统的应用

微信AI

32+阅读 · 2018年9月20日

ACL 2018 | 最佳短论文SQuAD 2.0：斯坦福大学发布的机器阅读理解问答数据集

ACL 2018 | 最佳短论文SQuAD 2.0：斯坦福大学发布的机器阅读理解问答数据集

机器之心

4+阅读 · 2018年6月13日

DuReader：百度大规模的中文机器阅读理解数据集

DuReader：百度大规模的中文机器阅读理解数据集

全球人工智能

7+阅读 · 2018年5月8日

业界 | 百度提出机器阅读理解技术V-NET，登顶MS MARCO数据集榜单

业界 | 百度提出机器阅读理解技术V-NET，登顶MS MARCO数据集榜单

机器之心

6+阅读 · 2018年2月22日

相关论文

Hierarchical Human Parsing with Typed Part-Relation Reasoning

Hierarchical Human Parsing with Typed Part-Relation Reasoning

Arxiv

6+阅读 · 2020年3月10日

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

Arxiv

19+阅读 · 2020年2月15日

Neural Module Networks for Reasoning over Text

Neural Module Networks for Reasoning over Text

Arxiv

9+阅读 · 2019年12月10日

Learning a Matching Model with Co-teaching for Multi-turn Response Selection in Retrieval-based Dialogue Systems

Arxiv

6+阅读 · 2019年6月11日

Commonsense Reasoning for Natural Language Understanding: A Survey of Benchmarks, Resources, and Approaches

Arxiv

16+阅读 · 2019年4月2日

Dialogue Natural Language Inference

Arxiv

7+阅读 · 2018年11月1日

Representation Learning for Visual-Relational Knowledge Graphs

Arxiv

9+阅读 · 2018年3月31日

A dataset and architecture for visual reasoning with a working memory

Arxiv

3+阅读 · 2018年3月16日

Object-based reasoning in VQA

Arxiv

6+阅读 · 2018年1月29日

Finding ReMO (Related Memory Object): A Simple Neural Architecture for Text based Reasoning

Arxiv

4+阅读 · 2018年1月26日

微信扫码咨询专知VIP会员