递归树树语法自动编码器 (Recursive Tree Grammar Autoencoders)

Machine learning on tree data has been mostly focused on trees as input. Much less research has investigates trees as output, like in molecule optimization for drug discovery or hint generation for intelligent tutoring systems. In this work, we propose a novel autoencoder approach, called recursive tree grammar autoencoder (RTG-AE), which encodes trees via a bottom-up parser and decodes trees via a tree grammar, both controlled by neural networks that minimize the variational autoencoder loss. The resulting encoding and decoding functions can then be employed in subsequent tasks, such as optimization and time series prediction. RTG-AE combines variational autoencoders, grammatical knowledge, and recursive processing. Our key message is that this combination improves performance compared to only combining two of these three components. In particular, we show experimentally that our proposed method improves the autoencoding error, training time, and optimization score on four benchmark datasets compared to baselines from the literature.

翻译：在树上学习的机器大多侧重于作为投入的树木。远没有研究将树木作为输出来调查, 如用于药物发现分子优化或智能辅导系统的提示生成。在这项工作中,我们提议了一种新型自动编码方法,叫做递归树语法自动编码器(RTG-AE),它通过自下而上的剖析器将树木编码,并通过树语法解码树,两者都由神经网络控制,以尽量减少变异自动编码器损失。由此产生的编码和解码功能随后可以用于其他任务,例如优化和时间序列预测。 RTG- AE 结合了变异自动编码器、语法学知识以及递归处理。我们的关键信息是, 这种组合可以提高性能,而只是将这三个组成部分中的两个合并起来。特别是, 我们实验地显示, 我们拟议的方法改善了自动编码错误、培训时间和四个基准数据集的优化得分, 与文献的基线比较。

相关内容

自编码器

关注 140

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

一份循环神经网络RNNs简明教程，37页ppt

专知会员服务

173+阅读 · 2020年5月6日

自动结构变分推理，Automatic structured variational inference

专知会员服务

41+阅读 · 2020年2月10日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日