DIN-SQL: 自我纠错文本到SQL的分解上下文学习 (DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction) - 专知论文

会员服务 ·

0

SQL · 分解 · 上下文学习 · 网络爬虫 · 上下文 ·

2023 年 4 月 21 日

DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction

翻译：DIN-SQL: 自我纠错文本到SQL的分解上下文学习

Mohammadreza Pourreza,Davood Rafiei

We study the problem of decomposing a complex text-to-sql task into smaller sub-tasks and how such a decomposition can significantly improve the performance of Large Language Models (LLMs) in the reasoning process. There is currently a significant gap between the performance of fine-tuned models and prompting approaches using LLMs on challenging text-to-sql datasets such as Spider. We show that SQL queries, despite their declarative structure, can be broken down into sub-problems and the solutions of those sub-problems can be fed into LLMs to significantly improve their performance. Our experiments with three LLMs show that this approach consistently improves their performance by roughly 10%, pushing the accuracy of LLMs towards state-of-the-art, and even beating large fine-tuned models on the holdout Spider dataset.

翻译：我们研究了如何将复杂的文本到SQL任务分解为较小的子任务，并且这种分解如何显著提高大型语言模型（LLM）在推理过程中的性能。目前，针对具有挑战性的文本到SQL数据集（如Spider）进行微调模型和使用LLM的提示方法之间存在显着差距。我们展示了SQL查询，尽管具有声明性结构，但可以分解成子问题，并且这些子问题的解决方案可以馈入LLM中，从而显著提高其性能。我们使用三个LLM进行的实验表明，这种方法始终将性能提高了大约10％，将LLM的准确性推向了最先进水平，甚至在保留数据集Spider上击败了大型微调模型。

0

相关内容

SQL

SQL 全名是结构化查询语言，是用于数据库中的标准数据查询语言，IBM 公司最早使用在其开发的数据库系统中。

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

专知会员服务

17+阅读 · 2020年5月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【ICLR2020论文】自我注意力与卷积层的关系，On the Relationship between Self-Attention and Convolutional Layers

【ICLR2020论文】自我注意力与卷积层的关系，On the Relationship between Self-Attention and Convolutional Layers

专知会员服务

37+阅读 · 2020年1月12日

【清华大学】Bert 简介，Bidirectional Encoder Representations from Transformers，21页ppt

【清华大学】Bert 简介，Bidirectional Encoder Representations from Transformers，21页ppt

专知会员服务

79+阅读 · 2019年12月29日

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

专知会员服务

53+阅读 · 2019年11月22日

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

专知会员服务

25+阅读 · 2019年11月15日

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

专知会员服务

49+阅读 · 2019年11月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

一文带你浏览Graph Transformers

一文带你浏览Graph Transformers

极市平台

1+阅读 · 2022年7月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

【论文推荐】最新六篇知识图谱相关论文—事件演化图、神经词义消歧、增强神经网络、Mem2Seq、用户偏好传播、概率嵌入

【论文推荐】最新六篇知识图谱相关论文—事件演化图、神经词义消歧、增强神经网络、Mem2Seq、用户偏好传播、概率嵌入

专知

19+阅读 · 2018年6月14日

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

专知

10+阅读 · 2018年4月12日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

处理效应差异中位数的有效估计

国家自然科学基金

0+阅读 · 2015年12月31日

挑战性羰基化合物的选择性催化氢化

国家自然科学基金

0+阅读 · 2014年12月31日

RSK2催化SOX9调控软骨肉瘤恶性生物学行为的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型金属-有机骨架基Z型光催化产氢材料的合成及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于钛基多孔光催化材料光还原CO2的研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型分子催化剂催化乙苯脱氢反应的性能与机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

针刺治疗神经外科开颅手术后恶心呕吐的临床和基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

模-相对Hochschild同调与上同调

国家自然科学基金

0+阅读 · 2011年12月31日

多孔超分子负载的纳米金属催化剂制备及其在有机反应中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

炼厂酸性气光催化分解制氢循环利用的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

SQL-PaLM: Improved Large Language Model Adaptation for Text-to-SQL

Arxiv

0+阅读 · 2023年6月7日

Leveraging Knowledge Graph Embeddings to Enhance Contextual Representations for Relation Extraction

Arxiv

0+阅读 · 2023年6月7日

On the Role of Attention in Prompt-tuning

Arxiv

0+阅读 · 2023年6月6日

CONCORD: Clone-aware Contrastive Learning for Source Code

Arxiv

0+阅读 · 2023年6月5日

Repository-Level Prompt Generation for Large Language Models of Code

Arxiv

1+阅读 · 2023年6月5日

Deep Reinforcement Learning with Swin Transformers

Arxiv

0+阅读 · 2023年6月5日

Efficient GPT Model Pre-training using Tensor Train Matrix Representation

Arxiv

0+阅读 · 2023年6月5日

Multi-Dimensional Evaluation of Text Summarization with In-Context Learning

Arxiv

0+阅读 · 2023年6月1日

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding

Arxiv

12+阅读 · 2020年4月15日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

VIP会员

文章信息

相关主题

上下文学习

相关VIP内容

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

【ACL2020】Span-ConveRT：预训练对话表示小样本跨度提取，Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations

专知会员服务

17+阅读 · 2020年5月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【ICLR2020论文】自我注意力与卷积层的关系，On the Relationship between Self-Attention and Convolutional Layers

【ICLR2020论文】自我注意力与卷积层的关系，On the Relationship between Self-Attention and Convolutional Layers

专知会员服务

37+阅读 · 2020年1月12日

【清华大学】Bert 简介，Bidirectional Encoder Representations from Transformers，21页ppt

【清华大学】Bert 简介，Bidirectional Encoder Representations from Transformers，21页ppt

专知会员服务

79+阅读 · 2019年12月29日

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

【AAAI2020】实体关系联合抽取的编码器-解码器结构的有效建模（ Effective Modeling of Encoder-Decoder Architecture for Joint Entity and Relation Extraction）

专知会员服务

53+阅读 · 2019年11月22日

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

【AAAI2020论文】关注实体以更好地理解文本（Attending to Entities for Better Text Understanding）

专知会员服务

25+阅读 · 2019年11月15日

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

【AAAI2020论文】概念结构化嵌入医疗文本表示（Learning Conceptual-Contextual Embeddings for Medical Text）

专知会员服务

49+阅读 · 2019年11月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

一文带你浏览Graph Transformers

一文带你浏览Graph Transformers

极市平台

1+阅读 · 2022年7月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

【论文推荐】最新九篇自动问答相关论文—可解释推理网络、上下文知识图谱嵌入、注意力RNN、Multi-Cast注意力网络

专知

15+阅读 · 2018年6月29日

【论文推荐】最新六篇知识图谱相关论文—事件演化图、神经词义消歧、增强神经网络、Mem2Seq、用户偏好传播、概率嵌入

【论文推荐】最新六篇知识图谱相关论文—事件演化图、神经词义消歧、增强神经网络、Mem2Seq、用户偏好传播、概率嵌入

专知

19+阅读 · 2018年6月14日

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

专知

10+阅读 · 2018年4月12日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

相关论文

SQL-PaLM: Improved Large Language Model Adaptation for Text-to-SQL

Arxiv

0+阅读 · 2023年6月7日

Leveraging Knowledge Graph Embeddings to Enhance Contextual Representations for Relation Extraction

Arxiv

0+阅读 · 2023年6月7日

On the Role of Attention in Prompt-tuning

Arxiv

0+阅读 · 2023年6月6日

CONCORD: Clone-aware Contrastive Learning for Source Code

Arxiv

0+阅读 · 2023年6月5日

Repository-Level Prompt Generation for Large Language Models of Code

Arxiv

1+阅读 · 2023年6月5日

Deep Reinforcement Learning with Swin Transformers

Arxiv

0+阅读 · 2023年6月5日

Efficient GPT Model Pre-training using Tensor Train Matrix Representation

Arxiv

0+阅读 · 2023年6月5日

Multi-Dimensional Evaluation of Text Summarization with In-Context Learning

Arxiv

0+阅读 · 2023年6月1日

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding

Arxiv

12+阅读 · 2020年4月15日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

相关基金

处理效应差异中位数的有效估计

国家自然科学基金

0+阅读 · 2015年12月31日

挑战性羰基化合物的选择性催化氢化

国家自然科学基金

0+阅读 · 2014年12月31日

RSK2催化SOX9调控软骨肉瘤恶性生物学行为的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型金属-有机骨架基Z型光催化产氢材料的合成及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于钛基多孔光催化材料光还原CO2的研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型分子催化剂催化乙苯脱氢反应的性能与机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

针刺治疗神经外科开颅手术后恶心呕吐的临床和基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

模-相对Hochschild同调与上同调

国家自然科学基金

0+阅读 · 2011年12月31日

多孔超分子负载的纳米金属催化剂制备及其在有机反应中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

炼厂酸性气光催化分解制氢循环利用的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员