MECPAD: 中国多领域预言-论证数据集 (MuCPAD: A Multi-Domain Chinese Predicate-Argument Dataset) - 专知论文

会员服务 ·

0

讲稿 · 数据集 · Performer · Neural Networks · Processing（编程语言） ·

2022 年 5 月 13 日

MuCPAD: A Multi-Domain Chinese Predicate-Argument Dataset

翻译：MECPAD: 中国多领域预言-论证数据集

Yahui Liu,Haoping Yang,Chen Gong,Qingrong Xia,Zhenghua Li,Min Zhang

from arxiv, Accepted by NAACL2022 (Main conference)

During the past decade, neural network models have made tremendous progress on in-domain semantic role labeling (SRL). However, performance drops dramatically under the out-of-domain setting. In order to facilitate research on cross-domain SRL, this paper presents MuCPAD, a multi-domain Chinese predicate-argument dataset, which consists of 30,897 sentences and 92,051 predicates from six different domains. MuCPAD exhibits three important features. 1) Based on a frame-free annotation methodology, we avoid writing complex frames for new predicates. 2) We explicitly annotate omitted core arguments to recover more complete semantic structure, considering that omission of content words is ubiquitous in multi-domain Chinese texts. 3) We compile 53 pages of annotation guidelines and adopt strict double annotation for improving data quality. This paper describes in detail the annotation methodology and annotation process of MuCPAD, and presents in-depth data analysis. We also give benchmark results on cross-domain SRL based on MuCPAD.

翻译：在过去十年中,神经网络模型在内部语义作用标签方面取得了巨大进展。然而,在外域设置下,性能显著下降。为了便利对跨域SRL进行研究,本文件介绍了中华多面的中国上游参数数据集MuCPAD, 这是一个由30,897个判决和来自六个不同领域的92,051个上游组成的多面中国上游参数数据集。中巴发委会展示了三个重要特征。 1)根据无框架说明方法,我们避免为新上游绘制复杂的框架。 2)我们明确说明遗漏的核心参数,以恢复更完整的语义结构,考虑到在多面中文文本中遗漏内容词是无处不在的。3)我们汇编了53页注解指南,并采用了严格的双重注解,以提高数据质量。本文详细介绍了《中巴发委》的注解方法和注过程,并介绍了深入的数据分析。我们还根据《中华发》对跨面SRL提供了基准结果。

0

相关内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

PTHrP NLS与C末端促进小鼠脊髓损伤后髓鞘再生的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

抑制LINGO-1促进移植人少突胶质前体细胞分化和髓鞘化的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

从调控星形胶质细胞活化异质性探讨益肾化浊通络法对多发性硬化髓鞘再生适应性保护效应机制

国家自然科学基金

0+阅读 · 2013年12月31日

石菖蒲抗阿尔茨海默病(AD)的药效物质基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

TfR抗体和CTX修饰纳米载体介导hTERTC27治疗神经胶质瘤

国家自然科学基金

0+阅读 · 2009年12月31日

Domain Adaptive Nuclei Instance Segmentation and Classification via Category-aware Feature Alignment and Pseudo-labelling

Arxiv

0+阅读 · 2022年7月4日

Calibrating Class Weights with Multi-Modal Information for Partial Video Domain Adaptation

Arxiv

0+阅读 · 2022年7月4日

LifeLonger: A Benchmark for Continual Disease Classification

Arxiv

0+阅读 · 2022年6月30日

"Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer

Arxiv

0+阅读 · 2022年6月30日

Chinese NER Using Lattice LSTM

Arxiv

14+阅读 · 2018年5月15日

VIP会员

文章信息

相关主题

Neural Networks

Processing（编程语言）

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

Domain Adaptive Nuclei Instance Segmentation and Classification via Category-aware Feature Alignment and Pseudo-labelling

Arxiv

0+阅读 · 2022年7月4日

Calibrating Class Weights with Multi-Modal Information for Partial Video Domain Adaptation

Arxiv

0+阅读 · 2022年7月4日

LifeLonger: A Benchmark for Continual Disease Classification

Arxiv

0+阅读 · 2022年6月30日

"Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer

Arxiv

0+阅读 · 2022年6月30日

Chinese NER Using Lattice LSTM

Arxiv

14+阅读 · 2018年5月15日

相关基金

PTHrP NLS与C末端促进小鼠脊髓损伤后髓鞘再生的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

抑制LINGO-1促进移植人少突胶质前体细胞分化和髓鞘化的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

从调控星形胶质细胞活化异质性探讨益肾化浊通络法对多发性硬化髓鞘再生适应性保护效应机制

国家自然科学基金

0+阅读 · 2013年12月31日

石菖蒲抗阿尔茨海默病(AD)的药效物质基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

TfR抗体和CTX修饰纳米载体介导hTERTC27治疗神经胶质瘤

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员