a 训练在通用限制方案制定解决方案内深Q-学习代理 (Training a Deep Q-Learning Agent Inside a Generic Constraint Programming Solver) - 专知论文

会员服务 ·

0

约束 · Learning · Agent · 图 · 相互独立的 ·

2023 年 1 月 5 日

Training a Deep Q-Learning Agent Inside a Generic Constraint Programming Solver

翻译：a 训练在通用限制方案制定解决方案内深Q-学习代理

Tom Marty,Tristan François,Pierre Tessier,Louis Gauthier,Quentin Cappart,Louis-Martin Rousseau

from arxiv, 15 pages

Constraint programming is known for being an efficient approach for solving combinatorial problems. Important design choices in a solver are the branching heuristics, which are designed to lead the search to the best solutions in a minimum amount of time. However, developing these heuristics is a time-consuming process that requires problem-specific expertise. This observation has motivated many efforts to use machine learning to automatically learn efficient heuristics without expert intervention. To the best of our knowledge, it is still an open research question. Although several generic variable-selection heuristics are available in the literature, the options for a generic value-selection heuristic are more scarce. In this paper, we propose to tackle this issue by introducing a generic learning procedure that can be used to obtain a value-selection heuristic inside a constraint programming solver. This has been achieved thanks to the combination of a deep Q-learning algorithm, a tailored reward signal, and a heterogeneous graph neural network architecture. Experiments on graph coloring, maximum independent set, and maximum cut problems show that our framework is able to find better solutions close to optimality without requiring a large amounts of backtracks while being generic.

翻译：以高效解决组合问题的高效方法而著称限制编程。解析器中的重要设计选择是分流结构学, 目的是引导在最短的时间内寻找最佳解决方案。但是, 开发这些超自然学是一个耗时的过程, 需要特定问题的专门知识。这一观察促使人们作出许多努力, 利用机器学习自动学习高效休养术, 而没有专家的干预。在我们的知识中, 它仍然是一个开放的研究问题。尽管文献中有一些通用的变量选择超自然学, 但通用价值选择超自然学的选择比较少。在本文中, 我们提议通过引入一个通用学习程序来解决这一问题, 该程序可以用来获得一个价值选择超自然的抑制程序。这一点之所以得以实现, 是因为一种深层次的Q- 学习算法、定制的奖赏信号和多元的图形神经网络结构相结合。在图形颜色、最大独立设置和最大剪切问题上进行的实验表明, 我们的框架能够找到更接近优化的解决方案, 而不需要大量的后行法。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Progerin/PrelaminA诱发早老症的蛋白质组学研究

国家自然科学基金

1+阅读 · 2015年12月31日

钢筋混凝土锈裂全过程多尺度分析与原位动态监测

国家自然科学基金

1+阅读 · 2014年12月31日

多离合器ISG混合动力汽车分层多模式切换协调控制与优化

国家自然科学基金

1+阅读 · 2014年12月31日

多因素不确定情况下路面最优养护维修策略决策方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Cre/loxP系统的肝特异性表达REGγ转基因小鼠的建立及脂质代谢分析

国家自然科学基金

0+阅读 · 2013年12月31日

光信号在植物microRNA转录和加工过程中的调控分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-328/SMO/GLI1解析脑胶质瘤中Hedgehog信号通路异常激活的新机制

国家自然科学基金

0+阅读 · 2012年12月31日

Spiking神经网络学习算法研究

国家自然科学基金

2+阅读 · 2012年12月31日

基于听觉感知的非平稳车辆噪声声品质智能评价方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

原发性肝细胞癌微灌注及弹性模量状态与复发机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Assigning Agents to Increase Network-Based Neighborhood Diversity

Arxiv

0+阅读 · 2023年3月3日

Bespoke: A Block-Level Neural Network Optimization Framework for Low-Cost Deployment

Arxiv

0+阅读 · 2023年3月3日

Deep Learning Based Code Generation Methods: A Literature Review

Arxiv

0+阅读 · 2023年3月2日

Multi-Task Self-Supervised Time-Series Representation Learning

Arxiv

0+阅读 · 2023年3月2日

On the Importance of Feature Representation for Flood Mapping using Classical Machine Learning Approaches

Arxiv

0+阅读 · 2023年3月1日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

Learning Discrete Structures for Graph Neural Networks

Arxiv

17+阅读 · 2019年3月28日

VIP会员

文章信息

相关主题

相互独立的

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《人工智能绝不能完全自主》

《人工智能的法律与伦理：军事自主机器独特挑战的深度剖析》316页

从数据到主导：AI与兵棋推演构筑决策优势

《特洛伊木马货柜：武器化集装箱的战略威胁》最新报告

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Assigning Agents to Increase Network-Based Neighborhood Diversity

Arxiv

0+阅读 · 2023年3月3日

Bespoke: A Block-Level Neural Network Optimization Framework for Low-Cost Deployment

Arxiv

0+阅读 · 2023年3月3日

Deep Learning Based Code Generation Methods: A Literature Review

Arxiv

0+阅读 · 2023年3月2日

Multi-Task Self-Supervised Time-Series Representation Learning

Arxiv

0+阅读 · 2023年3月2日

On the Importance of Feature Representation for Flood Mapping using Classical Machine Learning Approaches

Arxiv

0+阅读 · 2023年3月1日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

27+阅读 · 2021年6月16日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

Learning Discrete Structures for Graph Neural Networks

Arxiv

17+阅读 · 2019年3月28日

相关基金

Progerin/PrelaminA诱发早老症的蛋白质组学研究

国家自然科学基金

1+阅读 · 2015年12月31日

钢筋混凝土锈裂全过程多尺度分析与原位动态监测

国家自然科学基金

1+阅读 · 2014年12月31日

多离合器ISG混合动力汽车分层多模式切换协调控制与优化

国家自然科学基金

1+阅读 · 2014年12月31日

多因素不确定情况下路面最优养护维修策略决策方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Cre/loxP系统的肝特异性表达REGγ转基因小鼠的建立及脂质代谢分析

国家自然科学基金

0+阅读 · 2013年12月31日

光信号在植物microRNA转录和加工过程中的调控分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-328/SMO/GLI1解析脑胶质瘤中Hedgehog信号通路异常激活的新机制

国家自然科学基金

0+阅读 · 2012年12月31日

Spiking神经网络学习算法研究

国家自然科学基金

2+阅读 · 2012年12月31日

基于听觉感知的非平稳车辆噪声声品质智能评价方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

原发性肝细胞癌微灌注及弹性模量状态与复发机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员