BQ-NCO: 通用神经组合组合优化的比模拟引号 (BQ-NCO: Bisimulation Quotienting for Generalizable Neural Combinatorial Optimization) - 专知论文

会员服务 ·

0

优化器 · 泛化理论 · 示例 · 变换 · 可约的 ·

2023 年 1 月 9 日

BQ-NCO: Bisimulation Quotienting for Generalizable Neural Combinatorial Optimization

翻译：BQ-NCO: 通用神经组合组合优化的比模拟引号

Darko Drakulic,Sofia Michel,Florian Mai,Arnaud Sors,Jean-Marc Andreoli

Despite the success of Neural Combinatorial Optimization methods for end-to-end heuristic learning, out-of-distribution generalization remains a challenge. In this paper, we present a novel formulation of combinatorial optimization (CO) problems as Markov Decision Processes (MDPs) that effectively leverages symmetries of the CO problems to improve out-of-distribution robustness. Starting from the standard MDP formulation of constructive heuristics, we introduce a generic transformation based on bisimulation quotienting (BQ) in MDPs. This transformation allows to reduce the state space by accounting for the intrinsic symmetries of the CO problem and facilitates the MDP solving. We illustrate our approach on the Traveling Salesman, Capacitated Vehicle Routing and Knapsack Problems. We present a BQ reformulation of these problems and introduce a simple attention-based policy network that we train by imitation of (near) optimal solutions for small instances from a single distribution. We obtain new state-of-the-art generalization results for instances with up to 1000 nodes from synthetic and realistic benchmarks that vary both in size and node distributions.

翻译：尽管在最终到最终超常学的神经组合组合优化方法方面取得了成功,但是,在分配外的普及化方面仍然存在挑战。在本文件中,我们以Markov决策程序(MDPs)提出组合优化(CO)问题的新提法,有效地利用CO问题的对称性来提高分配的稳健性。从标准MDP的建设性超常主义制定开始,我们引入了基于多边发展方案中平衡商价(BQ)的通用转换。这种转变通过计算CO问题内在的对称性,可以缩小国家空间,并促进MDP的解决。我们介绍了我们对旅行推销员、卡帕齐特车辆朗普和Knappsack问题的做法。我们介绍了这些问题的BQ重新组合,并引入一个简单关注的政策网络,我们通过模仿单一分布的小型案例(近于)最佳解决方案来培训。我们获得了新的州通用化结果,从合成和现实的分布基准到1 000个不同大小和不相异的合成和不相容和不相容不相容的节制标准。

0

相关内容

优化器

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Waardenburg综合征的拷贝数变异检测及其致病机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

TRIM33在表观遗传水平上对TGF-β信号通路的调控

国家自然科学基金

0+阅读 · 2014年12月31日

钢管混凝土界面经历升、降温火灾后的粘结性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

肺微环境粘附分子谱对气道上皮／间质修复平衡点调控

国家自然科学基金

0+阅读 · 2012年12月31日

IS6基因突变导致青少年特发性脊柱侧凸的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用基因振荡回路合成PHBV嵌段式共聚物的基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

圆钢管K型、KK型搭接节点滞回性能分析与试验研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型动态自适应粒子群优化算法的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Efficient Domain Coverage for Vehicles with Second-Order Dynamics via Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年3月6日

Reinforcement Learning Based Self-play and State Stacking Techniques for Noisy Air Combat Environment

Arxiv

0+阅读 · 2023年3月6日

Geometric Batch Optimization for the Packing Equal Circles in a Circle Problem on Large Scale

Arxiv

0+阅读 · 2023年3月5日

L-2 Regularized maximum likelihood for $β$-model in large and sparse networks

Arxiv

0+阅读 · 2023年3月4日

An Integrated Real-time UAV Trajectory Optimization with Potential Field Approach for Dynamic Collision Avoidance

Arxiv

0+阅读 · 2023年3月3日

A Neuro-vector-symbolic Architecture for Solving Raven's Progressive Matrices

Arxiv

0+阅读 · 2023年3月3日

Bespoke: A Block-Level Neural Network Optimization Framework for Low-Cost Deployment

Arxiv

0+阅读 · 2023年3月3日

Distributed Optimization in Sensor Network for Scalable Multi-Robot Relative State Estimation

Arxiv

0+阅读 · 2023年3月2日

Automatic Performance Estimation for Decentralized Optimization

Arxiv

0+阅读 · 2023年3月2日

On estimating the structure factor of a point process, with applications to hyperuniformity

Arxiv

0+阅读 · 2023年3月2日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Efficient Domain Coverage for Vehicles with Second-Order Dynamics via Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年3月6日

Reinforcement Learning Based Self-play and State Stacking Techniques for Noisy Air Combat Environment

Arxiv

0+阅读 · 2023年3月6日

Geometric Batch Optimization for the Packing Equal Circles in a Circle Problem on Large Scale

Arxiv

0+阅读 · 2023年3月5日

L-2 Regularized maximum likelihood for $β$-model in large and sparse networks

Arxiv

0+阅读 · 2023年3月4日

An Integrated Real-time UAV Trajectory Optimization with Potential Field Approach for Dynamic Collision Avoidance

Arxiv

0+阅读 · 2023年3月3日

A Neuro-vector-symbolic Architecture for Solving Raven's Progressive Matrices

Arxiv

0+阅读 · 2023年3月3日

Bespoke: A Block-Level Neural Network Optimization Framework for Low-Cost Deployment

Arxiv

0+阅读 · 2023年3月3日

Distributed Optimization in Sensor Network for Scalable Multi-Robot Relative State Estimation

Arxiv

0+阅读 · 2023年3月2日

Automatic Performance Estimation for Decentralized Optimization

Arxiv

0+阅读 · 2023年3月2日

On estimating the structure factor of a point process, with applications to hyperuniformity

Arxiv

0+阅读 · 2023年3月2日

相关基金

Waardenburg综合征的拷贝数变异检测及其致病机制的研究

国家自然科学基金

0+阅读 · 2015年12月31日

TRIM33在表观遗传水平上对TGF-β信号通路的调控

国家自然科学基金

0+阅读 · 2014年12月31日

钢管混凝土界面经历升、降温火灾后的粘结性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

肺微环境粘附分子谱对气道上皮／间质修复平衡点调控

国家自然科学基金

0+阅读 · 2012年12月31日

IS6基因突变导致青少年特发性脊柱侧凸的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

TREM-1/DAP12/ NF-κB信号通路在6-姜烯酚抗动脉粥样硬化中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

利用基因振荡回路合成PHBV嵌段式共聚物的基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

圆钢管K型、KK型搭接节点滞回性能分析与试验研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型动态自适应粒子群优化算法的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员