通过问题分解进行微弱监督的文本到 SQL 分解 (Weakly Supervised Text-to-SQL Parsing through Question Decomposition) - 专知论文

会员服务 ·

0

SQL · 监督 · 监督模型 · Performer · 模型性能 ·

2022 年 4 月 23 日

Weakly Supervised Text-to-SQL Parsing through Question Decomposition

翻译：通过问题分解进行微弱监督的文本到 SQL 分解

Tomer Wolfson,Daniel Deutch,Jonathan Berant

from arxiv, Accepted for publication in Findings of NAACL 2022. Author's final version

Text-to-SQL parsers are crucial in enabling non-experts to effortlessly query relational data. Training such parsers, by contrast, generally requires expertise in annotating natural language (NL) utterances with corresponding SQL queries. In this work, we propose a weak supervision approach for training text-to-SQL parsers. We take advantage of the recently proposed question meaning representation called QDMR, an intermediate between NL and formal query languages. Given questions, their QDMR structures (annotated by non-experts or automatically predicted), and the answers, we are able to automatically synthesize SQL queries that are used to train text-to-SQL models. We test our approach by experimenting on five benchmark datasets. Our results show that the weakly supervised models perform competitively with those trained on annotated NL-SQL data. Overall, we effectively train text-to-SQL parsers, while using zero SQL annotations.

翻译：文本到 SQL 分析器对于让非专家不费力地查询关系数据至关重要。相反,培训这类分析器通常需要用相应的 SQL 查询说明自然语言(NL) 表达方式的专门知识。在这项工作中,我们建议对培训文本到 SQL 分析器采取一种薄弱的监督方法。我们利用最近提出的问题含义说明法,即NL-SQL 和正式查询语言之间的中间语言QDMR。鉴于问题,他们的QDMR结构(由非专家附加说明或自动预测)和答案,我们能够自动合成用于培训文本到 SQL 模型的SQL 查询。我们通过试验五个基准数据集来测试我们的方法。我们的结果显示,薄弱的监督模型与那些受过附加说明的NL-SQL 数据培训的模型具有竞争力。总体而言,我们有效地培训了文本到 SQL 分析器,同时使用零 SQL 说明。

0

相关内容

SQL

SQL 全名是结构化查询语言，是用于数据库中的标准数据查询语言，IBM 公司最早使用在其开发的数据库系统中。

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

33+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

机器学习研究会

11+阅读 · 2018年1月14日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

IDH突变肿瘤代谢物二羟基戊二酸致MDS向AML转化的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

AEG-1 siRNA和阿霉素共传递抑制骨肉瘤生长和转移作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

RNA/Peptide双重适配体介导腺病毒/阿霉素肿瘤靶向递药系统构建及抑癌机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

DNA甲基化特异性改变对结肠癌干细胞上皮间质转化表型及肝转移的调控

国家自然科学基金

0+阅读 · 2012年12月31日

miR-182通过MET和CTTN基因及其相关信号通路抑制肺癌转移的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

片仔癀调控microRNA抑制结肠癌上皮间质转化的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

胰腺癌细胞通过微囊泡（MV）运载miR-24调控淋巴管内皮细胞生长的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

生物相容性嵌段共聚物与无机磁光纳米粒子的组装与过程调控

国家自然科学基金

0+阅读 · 2012年12月31日

鼻NK/T 细胞淋巴瘤EB病毒微小RNA对细胞周期抑制基因表达调控及其与肿瘤演进关系

国家自然科学基金

0+阅读 · 2012年12月31日

Ｓlingshot-1L/LIM Kinase1信号网络逆转骨肉瘤转移及多药耐药的机制

国家自然科学基金

0+阅读 · 2011年12月31日

Unsupervised and Few-shot Parsing from Pretrained Language Models

Unsupervised and Few-shot Parsing from Pretrained Language Models

Arxiv

0+阅读 · 2022年6月10日

Sparse Conditional Hidden Markov Model for Weakly Supervised Named Entity Recognition

Arxiv

0+阅读 · 2022年6月7日

Spatial Parsing and Dynamic Temporal Pooling networks for Human-Object Interaction detection

Arxiv

0+阅读 · 2022年6月7日

Masked Unsupervised Self-training for Zero-shot Image Classification

Arxiv

0+阅读 · 2022年6月7日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Arxiv

13+阅读 · 2021年4月7日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

Few-shot Learning for Multi-label Intent Detection

Arxiv

21+阅读 · 2020年10月11日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

Few-shot acoustic event detection via meta-learning

Arxiv

26+阅读 · 2020年2月21日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

33+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML2025】SADA：基于稳定性引导的自适应扩散加速方法

【ETZH博士论文】低维与高维空间中潜在表示的分析、建模与变换，169页pdf

车辆目标轨迹预测方法研究综述及展望

【ACL2025教程】LLM时代的合成数据，228页slides

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

【推荐】用TensorFlow实现LSTM社交对话股市情感分析

机器学习研究会

11+阅读 · 2018年1月14日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

相关论文

Unsupervised and Few-shot Parsing from Pretrained Language Models

Unsupervised and Few-shot Parsing from Pretrained Language Models

Arxiv

0+阅读 · 2022年6月10日

Sparse Conditional Hidden Markov Model for Weakly Supervised Named Entity Recognition

Arxiv

0+阅读 · 2022年6月7日

Spatial Parsing and Dynamic Temporal Pooling networks for Human-Object Interaction detection

Arxiv

0+阅读 · 2022年6月7日

Masked Unsupervised Self-training for Zero-shot Image Classification

Arxiv

0+阅读 · 2022年6月7日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Arxiv

13+阅读 · 2021年4月7日

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

Arxiv

12+阅读 · 2020年12月14日

Few-shot Learning for Multi-label Intent Detection

Arxiv

21+阅读 · 2020年10月11日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

Few-shot acoustic event detection via meta-learning

Arxiv

26+阅读 · 2020年2月21日

相关基金

IDH突变肿瘤代谢物二羟基戊二酸致MDS向AML转化的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

AEG-1 siRNA和阿霉素共传递抑制骨肉瘤生长和转移作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

RNA/Peptide双重适配体介导腺病毒/阿霉素肿瘤靶向递药系统构建及抑癌机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

DNA甲基化特异性改变对结肠癌干细胞上皮间质转化表型及肝转移的调控

国家自然科学基金

0+阅读 · 2012年12月31日

miR-182通过MET和CTTN基因及其相关信号通路抑制肺癌转移的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

片仔癀调控microRNA抑制结肠癌上皮间质转化的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

胰腺癌细胞通过微囊泡（MV）运载miR-24调控淋巴管内皮细胞生长的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

生物相容性嵌段共聚物与无机磁光纳米粒子的组装与过程调控

国家自然科学基金

0+阅读 · 2012年12月31日

鼻NK/T 细胞淋巴瘤EB病毒微小RNA对细胞周期抑制基因表达调控及其与肿瘤演进关系

国家自然科学基金

0+阅读 · 2012年12月31日

Ｓlingshot-1L/LIM Kinase1信号网络逆转骨肉瘤转移及多药耐药的机制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员