Rewrite systems [6, 10, 12] have been widely employing equality saturation [9], which is an optimisation methodology that uses a saturated e-graph to represent all possible sequences of rewrite simultaneously, and then extracts the optimal one. As such, optimal results can be achieved by avoiding the phase-ordering problem. However, we observe that when the e-graph is not saturated, it cannot represent all possible rewrite opportunities and therefore the phase-ordering problem is re-introduced during the construction phase of the e-graph. To address this problem, we propose MCTS-GEB, a domain-general rewrite system that applies reinforcement learning (RL) to e-graph construction. At its core, MCTS-GEB uses a Monte Carlo Tree Search (MCTS) [3] to efficiently plan for the optimal e-graph construction, and therefore it can effectively eliminate the phase-ordering problem at the construction phase and achieve better performance within a reasonable time. Evaluation in two different domains shows MCTS-GEB can outperform the state-of-the-art rewrite systems by up to 49x, while the optimisation can generally take less than an hour, indicating MCTS-GEB is a promising building block for the future generation of rewrite systems.
翻译:重写系统已广泛采用相等性饱和(equality saturation)进行优化(例如[6,10,12]),该方法使用饱和E图同时表示所有可能的重写序列,然后提取最佳重写序列。这样,通过避免阶段顺序问题,可以实现最佳结果。但是,我们观察到,当E图未饱和时,它无法表示所有可能的重写机会,因此在E图构建阶段重新引入了阶段顺序问题。为解决这个问题,我们提出了MCTS-GEB,一种将强化学习应用于E图构建的通用领域重写系统。在其核心,MCTS-GEB 使用蒙特卡罗树搜索(MCTS)[3]高效规划最佳E图构建,因此可以在构建阶段有效地消除阶段顺序问题,并在合理的时间内实现更好的性能。在两个不同的领域进行评估表明,MCTS-GEB 可以比最先进的重写系统优化高达49倍,而优化通常不超过一小时,表明MCTS-GEB 是未来重写系统的有前途的构建模块。