通过递增树形转换进行学习结构编辑 (Learning Structural Edits via Incremental Tree Transformations)

While most neural generative models generate outputs in a single pass, the human creative process is usually one of iterative building and refinement. Recent work has proposed models of editing processes, but these mostly focus on editing sequential data and/or only model a single editing pass. In this paper, we present a generic model for incremental editing of structured data (i.e., "structural edits"). Particularly, we focus on tree-structured data, taking abstract syntax trees of computer programs as our canonical example. Our editor learns to iteratively generate tree edits (e.g., deleting or adding a subtree) and applies them to the partially edited data, thereby the entire editing process can be formulated as consecutive, incremental tree transformations. To show the unique benefits of modeling tree edits directly, we further propose a novel edit encoder for learning to represent edits, as well as an imitation learning method that allows the editor to be more robust. We evaluate our proposed editor on two source code edit datasets, where results show that, with the proposed edit encoder, our editor significantly improves accuracy over previous approaches that generate the edited program directly in one pass. Finally, we demonstrate that training our editor to imitate experts and correct its mistakes dynamically can further improve its performance.

翻译：虽然大多数神经基因变异模型都是通过一个传球生成产出,但人类的创造过程通常是迭接的构建和完善过程。最近的工作提出了编辑过程的模型,但这些模型主要侧重于编辑顺序数据和(或)仅仅模拟单一编辑通行证。在本文中,我们提出了一个结构化数据(即“结构编辑”)逐步编辑的通用模型。特别是,我们侧重于树结构化数据,以计算机程序的抽象合成词树为例。我们的编辑学会迭接生成树木编辑(例如,删除或添加一个子树)并将其应用到部分编辑的数据中,因此整个编辑过程可以作为连续、递增的树变换而形成。为了直接显示树的建模编辑的独特好处,我们进一步提议了一个新的编辑编码编码,以及一种模拟学习方法,使编辑能够更加强大。我们用两个源代码编辑数据集来评价我们提议的编辑,结果显示,随着拟议的编辑编码,我们的编辑过程可以大大改进以前的精确度,从而直接改进了我们的编辑程序。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【干货书】Python程序员编程，810页pdf，Python® for Programmers

专知会员服务

62+阅读 · 2020年8月6日

【干货书】管理统计和数据科学原理，678页pdf

专知会员服务

186+阅读 · 2020年7月29日

【WSDN 2020 论文】一种结构图表示学习框架（A Structural Graph Representation Learning Framework）

专知会员服务

74+阅读 · 2019年11月20日