【SIGMOD2020】一个全面的主动学习方法的实体匹配基准框架，A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching - 专知VIP

会员服务 ·

1

实体匹配 · 自然语言处理 · 主动学习 · 人工智能 ·

2020 年 3 月 31 日

【SIGMOD2020】一个全面的主动学习方法的实体匹配基准框架，A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

标题

一个全面的主动学习方法的实体匹配基准框架，A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching

关键字

实体匹配，自然语言处理，主动学习，人工智能

简介

实体匹配（EM）是一项核心的数据清理任务，旨在识别同一真实世界实体的不同提及。主动学习是在实践中解决稀缺标签数据挑战的一种方法，方法是动态收集要由Oracle标记的必要示例并在其上完善学习的模型（分类器）。在本文中，我们为EM建立了统一的主动学习基准框架，使用户可以轻松地将不同的学习算法与适用的示例选择算法结合起来。该框架的目标是为从业人员制定具体的指导方针，以说明哪些主动学习组合将对EM有效。为此，我们使用包括EM质量，＃labels和示例选择等待时间在内的各种指标，对来自产品和出版领域的公开可用EM数据集进行了全面的实验，以评估主动学习方法。我们最令人惊讶的结果发现，标签较少的主动学习可以学习质量与监督学习相当的分类器。实际上，对于其中的一些数据集，我们表明有一种主动的学习组合可以击败最新的监督学习结果。我们的框架还包括新颖的优化功能，这些功能可将学习模型的F1分数提高大约9％，并将示例选择延迟降低10倍，而不会影响模型的质量。

作者

Vamsi Meduri，Lucian Popa，Prithviraj Sen，Mohamed Sarwat，来自Arizona State University，IBM Research, Almaden

成为VIP会员查看完整内容

24

相关内容

实体匹配

【IJCAJ 2019】多视角知识图谱嵌入的实体对齐，Multi-view Knowledge Graph Embedding for Entity Alignment

【IJCAJ 2019】多视角知识图谱嵌入的实体对齐，Multi-view Knowledge Graph Embedding for Entity Alignment

专知会员服务

59+阅读 · 2020年6月30日

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

专知会员服务

49+阅读 · 2020年5月26日

【Facebook AI】自监督学习在计算机视觉应用最新概述，108页ppt Self-supervised learning

【Facebook AI】自监督学习在计算机视觉应用最新概述，108页ppt Self-supervised learning

专知会员服务

164+阅读 · 2020年4月19日

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

专知会员服务

48+阅读 · 2020年4月13日

深度强化学习方法及其在经济学中的应用综述，Comprehensive Review of Deep Reinforcement Learning Methods and Applicationsin Economic

深度强化学习方法及其在经济学中的应用综述，Comprehensive Review of Deep Reinforcement Learning Methods and Applicationsin Economic

专知会员服务

52+阅读 · 2020年4月7日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【强化学习资源集合】Awesome Reinforcement Learning

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日

【WSDN 2020 论文】一种结构图表示学习框架（A Structural Graph Representation Learning Framework）

【WSDN 2020 论文】一种结构图表示学习框架（A Structural Graph Representation Learning Framework）

专知会员服务

74+阅读 · 2019年11月20日

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

专知会员服务

99+阅读 · 2019年11月11日

【ICML2019 tutorial】主动学习:从理论到实践（Active Learning: From Theory to Practice），Robert Nowak，Steve Hanneke

【ICML2019 tutorial】主动学习:从理论到实践（Active Learning: From Theory to Practice），Robert Nowak，Steve Hanneke

专知会员服务

48+阅读 · 2019年6月10日

NLP+CV《桥接视觉与语言的研究综述》，带你全面了解视觉+语言最新应用和方法

NLP+CV《桥接视觉与语言的研究综述》，带你全面了解视觉+语言最新应用和方法

中国人工智能学会

27+阅读 · 2019年7月24日

ACL 2019开源论文 | 句对匹配任务中的样本选择偏差与去偏方法

ACL 2019开源论文 | 句对匹配任务中的样本选择偏差与去偏方法

PaperWeekly

6+阅读 · 2019年7月12日

命名实体识别（NER）综述

命名实体识别（NER）综述

AI研习社

66+阅读 · 2019年1月30日

南洋理工最新《命名实体识别深度学习方法》综述论文，25页pdf

南洋理工最新《命名实体识别深度学习方法》综述论文，25页pdf

专知

46+阅读 · 2018年12月28日

资源 | 一份非常全面的开源数据集

资源 | 一份非常全面的开源数据集

黑龙江大学自然语言处理实验室

10+阅读 · 2018年9月7日

笔记 | Deep active learning for named entity recognition

笔记 | Deep active learning for named entity recognition

黑龙江大学自然语言处理实验室

24+阅读 · 2018年5月27日

LibRec 精选：推荐系统9个必备数据集

LibRec 精选：推荐系统9个必备数据集

LibRec智能推荐

6+阅读 · 2018年3月7日

Machine Learning：十大机器学习算法

Machine Learning：十大机器学习算法

开源中国

21+阅读 · 2018年3月1日

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

开放知识图谱

4+阅读 · 2017年12月30日

深度文本匹配开源工具（MatchZoo）

深度文本匹配开源工具（MatchZoo）

中国科学院网络数据重点实验室

7+阅读 · 2017年12月5日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

Dynamic Transfer Learning for Named Entity Recognition

Dynamic Transfer Learning for Named Entity Recognition

Arxiv

5+阅读 · 2019年5月1日

Multi-Instance Learning for End-to-End Knowledge Base Question Answering

Multi-Instance Learning for End-to-End Knowledge Base Question Answering

Arxiv

4+阅读 · 2019年3月6日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

73+阅读 · 2018年12月22日

A Survey of Learning Causality with Data: Problems and Methods

A Survey of Learning Causality with Data: Problems and Methods

Arxiv

19+阅读 · 2018年9月25日

Deep Reinforcement Learning: An Overview

Arxiv

15+阅读 · 2018年6月23日

DeepFM: An End-to-End Wide & Deep Learning Framework for CTR Prediction

Arxiv

6+阅读 · 2018年4月12日

Deep Active Learning for Named Entity Recognition

Arxiv

15+阅读 · 2018年2月4日

MatchZoo: A Toolkit for Deep Text Matching

Arxiv

5+阅读 · 2017年7月23日

VIP会员

相关主题

自然语言处理

相关VIP内容

【IJCAJ 2019】多视角知识图谱嵌入的实体对齐，Multi-view Knowledge Graph Embedding for Entity Alignment

【IJCAJ 2019】多视角知识图谱嵌入的实体对齐，Multi-view Knowledge Graph Embedding for Entity Alignment

专知会员服务

59+阅读 · 2020年6月30日

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

【论文推荐】层次知识图谱，Hierarchical Knowledge Graphs: A Novel Information Representation for Exploratory Search Tasks

专知会员服务

49+阅读 · 2020年5月26日

【Facebook AI】自监督学习在计算机视觉应用最新概述，108页ppt Self-supervised learning

【Facebook AI】自监督学习在计算机视觉应用最新概述，108页ppt Self-supervised learning

专知会员服务

164+阅读 · 2020年4月19日

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

【CVPR2020-国科大】状态标签对抗主动学习，Adversarial Active Learning

专知会员服务

48+阅读 · 2020年4月13日

深度强化学习方法及其在经济学中的应用综述，Comprehensive Review of Deep Reinforcement Learning Methods and Applicationsin Economic

深度强化学习方法及其在经济学中的应用综述，Comprehensive Review of Deep Reinforcement Learning Methods and Applicationsin Economic

专知会员服务

52+阅读 · 2020年4月7日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

【强化学习资源集合】Awesome Reinforcement Learning

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日

【WSDN 2020 论文】一种结构图表示学习框架（A Structural Graph Representation Learning Framework）

【WSDN 2020 论文】一种结构图表示学习框架（A Structural Graph Representation Learning Framework）

专知会员服务

74+阅读 · 2019年11月20日

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

专知会员服务

99+阅读 · 2019年11月11日

【ICML2019 tutorial】主动学习:从理论到实践（Active Learning: From Theory to Practice），Robert Nowak，Steve Hanneke

【ICML2019 tutorial】主动学习:从理论到实践（Active Learning: From Theory to Practice），Robert Nowak，Steve Hanneke

专知会员服务

48+阅读 · 2019年6月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《美陆军徒步机动作战条令手册》最新168页

【博士论文】基于不确定性的可靠性：现代机器学习中的选择性预测与可信部署

军事后勤数字化未来展望

《美海军后勤体系整合与创新挑战》最新报告

相关资讯

NLP+CV《桥接视觉与语言的研究综述》，带你全面了解视觉+语言最新应用和方法

NLP+CV《桥接视觉与语言的研究综述》，带你全面了解视觉+语言最新应用和方法

中国人工智能学会

27+阅读 · 2019年7月24日

ACL 2019开源论文 | 句对匹配任务中的样本选择偏差与去偏方法

ACL 2019开源论文 | 句对匹配任务中的样本选择偏差与去偏方法

PaperWeekly

6+阅读 · 2019年7月12日

命名实体识别（NER）综述

命名实体识别（NER）综述

AI研习社

66+阅读 · 2019年1月30日

南洋理工最新《命名实体识别深度学习方法》综述论文，25页pdf

南洋理工最新《命名实体识别深度学习方法》综述论文，25页pdf

专知

46+阅读 · 2018年12月28日

资源 | 一份非常全面的开源数据集

资源 | 一份非常全面的开源数据集

黑龙江大学自然语言处理实验室

10+阅读 · 2018年9月7日

笔记 | Deep active learning for named entity recognition

笔记 | Deep active learning for named entity recognition

黑龙江大学自然语言处理实验室

24+阅读 · 2018年5月27日

LibRec 精选：推荐系统9个必备数据集

LibRec 精选：推荐系统9个必备数据集

LibRec智能推荐

6+阅读 · 2018年3月7日

Machine Learning：十大机器学习算法

Machine Learning：十大机器学习算法

开源中国

21+阅读 · 2018年3月1日

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

论文浅尝 | Hike: A Hybrid Human-Machine Method for Entity Alignment

开放知识图谱

4+阅读 · 2017年12月30日

深度文本匹配开源工具（MatchZoo）

深度文本匹配开源工具（MatchZoo）

中国科学院网络数据重点实验室

7+阅读 · 2017年12月5日

相关论文

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

Dynamic Transfer Learning for Named Entity Recognition

Dynamic Transfer Learning for Named Entity Recognition

Arxiv

5+阅读 · 2019年5月1日

Multi-Instance Learning for End-to-End Knowledge Base Question Answering

Multi-Instance Learning for End-to-End Knowledge Base Question Answering

Arxiv

4+阅读 · 2019年3月6日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

73+阅读 · 2018年12月22日

A Survey of Learning Causality with Data: Problems and Methods

A Survey of Learning Causality with Data: Problems and Methods

Arxiv

19+阅读 · 2018年9月25日

Deep Reinforcement Learning: An Overview

Arxiv

15+阅读 · 2018年6月23日

DeepFM: An End-to-End Wide & Deep Learning Framework for CTR Prediction

Arxiv

6+阅读 · 2018年4月12日

Deep Active Learning for Named Entity Recognition

Arxiv

15+阅读 · 2018年2月4日

MatchZoo: A Toolkit for Deep Text Matching

Arxiv

5+阅读 · 2017年7月23日

微信扫码咨询专知VIP会员