Duorat: 走向简单的文本到SQL模型 (DuoRAT: Towards Simpler Text-to-SQL Models)

Recent neural text-to-SQL models can effectively translate natural language questions to corresponding SQL queries on unseen databases. Working mostly on the Spider dataset, researchers have proposed increasingly sophisticated solutions to the problem. Contrary to this trend, in this paper we focus on simplifications. We begin by building DuoRAT, a re-implementation of the state-of-the-art RAT-SQL model that unlike RAT-SQL is using only relation-aware or vanilla transformers as the building blocks. We perform several ablation experiments using DuoRAT as the baseline model. Our experiments confirm the usefulness of some techniques and point out the redundancy of others, including structural SQL features and features that link the question with the schema.

翻译：最近的神经文本到 SQL 模型可以有效地将自然语言问题转化为对未知数据库的相应 SQL 查询。研究人员主要在蜘蛛数据集上提出越来越复杂的解决问题的办法。与这一趋势相反,我们在本文件中侧重于简化。我们首先建设Duorat,这是与RAT-SQL 不同的最新RAT-SQL 模型的重新应用,它与RAT-SQL 模型不同,它只使用有关系或香草变压器作为构件。我们用Duorat 作为基线模型,进行了数项通缩实验。我们的实验证实了某些技术的有用性,并指出了其他技术的冗余,包括SQL 结构特征和将问题与Schema联系起来的特征。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/