We present a graph neural network to learn graph coloring heuristics using reinforcement learning. Our learned deterministic heuristics give better solutions than classical degree-based greedy heuristics and only take seconds to evaluate on graphs with tens of thousands of vertices. As our approach is based on policy-gradients, it also learns a probabilistic policy as well. These probabilistic policies outperform all greedy coloring baselines and a machine learning baseline. Our approach generalizes several previous machine-learning frameworks, which applied to problems like minimum vertex cover. We also demonstrate that our approach outperforms two greedy heuristics on minimum vertex cover.
翻译:我们展示了一个图形神经网络,用强化学习来学习图形色素。我们所学的确定性理论提供了比古老的基于学位的贪婪黄金主义更好的解决方案,而仅仅花几秒钟来评估带有数万顶脊椎的图表。由于我们的方法以政策等级为基础,它也学习了一种概率政策。这些概率政策超过了所有贪婪的颜色基线和机器学习基线。我们的方法概括了以前应用于最低顶层覆盖等问题的机器学习框架。我们还表明,我们的方法在最低顶层覆盖上超过了两种贪婪的黄金主义。