带变压器的端对端象征性回归 (End-to-end symbolic regression with transformers) - 专知论文

会员服务 ·

0

端到端 · 变换 · 优化器 · 泛函 · 损失函数（机器学习） ·

2022 年 4 月 22 日

End-to-end symbolic regression with transformers

翻译：带变压器的端对端象征性回归

Pierre-Alexandre Kamienny,Stéphane d'Ascoli,Guillaume Lample,François Charton

Symbolic regression, the task of predicting the mathematical expression of a function from the observation of its values, is a difficult task which usually involves a two-step procedure: predicting the "skeleton" of the expression up to the choice of numerical constants, then fitting the constants by optimizing a non-convex loss function. The dominant approach is genetic programming, which evolves candidates by iterating this subroutine a large number of times. Neural networks have recently been tasked to predict the correct skeleton in a single try, but remain much less powerful. In this paper, we challenge this two-step procedure, and task a Transformer to directly predict the full mathematical expression, constants included. One can subsequently refine the predicted constants by feeding them to the non-convex optimizer as an informed initialization. We present ablations to show that this end-to-end approach yields better results, sometimes even without the refinement step. We evaluate our model on problems from the SRBench benchmark and show that our model approaches the performance of state-of-the-art genetic programming with several orders of magnitude faster inference.

翻译：符号回归, 即预测从观察其值得出的函数的数学表达方式的任务, 是一项困难的任务, 通常需要两步程序: 预测表达的“ skeleton”, 直至选择数字常数, 然后通过优化非convex 损失函数来匹配常数。主导的方法是基因编程, 通过迭代这个子路程, 大量时间来使候选人进化。神经网络最近被赋予一项任务, 在一个尝试中预测正确的骨架, 但仍然没有那么强大。在本文中, 我们质疑这个两步程序, 并责成一个变异器直接预测完整的数学表达, 包括常数。之后, 一个人可以将其输入到非convex 优化器作为知情的初始化程序, 从而改进预测的常数。我们提出一个推算, 以显示这种端对端方法产生更好的效果, 有时甚至没有精细的步骤。我们从SRBench基准中评估了我们的模型, 并显示我们的模型以几级速度快速的推导力来进行状态的基因编程的运行。

0

相关内容

端到端

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

先进半导体热电材料微观机理的第一性原理研究及性能优化和设计

国家自然科学基金

1+阅读 · 2014年12月31日

纳米晶纯钛电流辅助微成形尺度效应及增塑机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

表面等离激元调控的纳米结构聚焦与波导

国家自然科学基金

0+阅读 · 2012年12月31日

基于高分辨红外光谱成像及ATR空心波导红外光谱学的骨关节炎基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

闪烁光纤屏光锥耦合的热中子探测与成像应用基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

离子辐照周期极化铌酸锂晶体波导结构与倍频特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

电荷输运的电化学辅助STM裂结技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

磁流体涡旋动力学的若干研究

国家自然科学基金

0+阅读 · 2008年12月31日

RIGID: Robust Linear Regression with Missing Data

Arxiv

0+阅读 · 2022年6月10日

Unveiling Transformers with LEGO: a synthetic reasoning task

Arxiv

0+阅读 · 2022年6月9日

Set Interdependence Transformer: Set-to-Sequence Neural Networks for Permutation Learning and Structure Prediction

Arxiv

0+阅读 · 2022年6月8日

Predictions of Electromotive Force of Magnetic Shape Memory Alloy (MSMA) Using Constitutive Model and Generalized Regression Neural Network

Arxiv

0+阅读 · 2022年6月8日

SelfReformer: Self-Refined Network with Transformer for Salient Object Detection

Arxiv

0+阅读 · 2022年6月7日

EDTER: Edge Detection with Transformer

Arxiv

11+阅读 · 2022年3月16日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

How Powerful are Graph Neural Networks?

Arxiv

23+阅读 · 2018年10月1日

Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks

Arxiv

17+阅读 · 2018年6月5日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

损失函数（机器学习）

相关VIP内容

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机战争时代的战时法：大国竞争中的区分原则、相称性原则与行动建议》最新75页

《构建强健军事力量的设计挑战：提升海军兵力支持系统效能的多分辨率建模方法》69页

正视无人机心理战：恐惧效应与战略反思

《精确反蜂群防御系统：三维运动探测与定向空爆拦截技术融合》最新24页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

RIGID: Robust Linear Regression with Missing Data

Arxiv

0+阅读 · 2022年6月10日

Unveiling Transformers with LEGO: a synthetic reasoning task

Arxiv

0+阅读 · 2022年6月9日

Set Interdependence Transformer: Set-to-Sequence Neural Networks for Permutation Learning and Structure Prediction

Arxiv

0+阅读 · 2022年6月8日

Predictions of Electromotive Force of Magnetic Shape Memory Alloy (MSMA) Using Constitutive Model and Generalized Regression Neural Network

Arxiv

0+阅读 · 2022年6月8日

SelfReformer: Self-Refined Network with Transformer for Salient Object Detection

Arxiv

0+阅读 · 2022年6月7日

EDTER: Edge Detection with Transformer

Arxiv

11+阅读 · 2022年3月16日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

How Powerful are Graph Neural Networks?

Arxiv

23+阅读 · 2018年10月1日

Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks

Arxiv

17+阅读 · 2018年6月5日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

相关基金

先进半导体热电材料微观机理的第一性原理研究及性能优化和设计

国家自然科学基金

1+阅读 · 2014年12月31日

纳米晶纯钛电流辅助微成形尺度效应及增塑机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

表面等离激元调控的纳米结构聚焦与波导

国家自然科学基金

0+阅读 · 2012年12月31日

基于高分辨红外光谱成像及ATR空心波导红外光谱学的骨关节炎基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

闪烁光纤屏光锥耦合的热中子探测与成像应用基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

离子辐照周期极化铌酸锂晶体波导结构与倍频特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

电荷输运的电化学辅助STM裂结技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

磁流体涡旋动力学的若干研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员