We propose a novel Parallel Monte Carlo tree search with Batched Simulations (PMBS) algorithm for accelerating long-horizon, episodic robotic planning tasks. Monte Carlo tree search (MCTS) is an effective heuristic search algorithm for solving episodic decision-making problems whose underlying search spaces are expansive. Leveraging a GPU-based large-scale simulator, PMBS introduces massive parallelism into MCTS for solving planning tasks through the batched execution of a large number of concurrent simulations, which allows for more efficient and accurate evaluations of the expected cost-to-go over large action spaces. When applied to the challenging manipulation tasks of object retrieval from clutter, PMBS achieves a speedup of over $30\times$ with an improved solution quality, in comparison to a serial MCTS implementation. We show that PMBS can be directly applied to real robot hardware with negligible sim-to-real differences. Supplementary material, including video, can be found at https://github.com/arc-l/pmbs.
翻译:我们建议采用小说《平行的蒙特卡洛树》搜索平行的蒙塔卡洛树搜索,使用Batched模拟(PMBS)算法来加快长视距,即偶发的机器人规划任务。蒙特卡洛树搜索(MCTS)是一种有效的超理论搜索算法,用于解决其基本搜索空间很广的偶发决策问题。利用基于GPU的大型模拟器,PMBS将大规模平行主义引入MCTS,通过分批进行大量同时模拟来解决规划任务,从而能够更有效和更准确地评估大型行动空间的预期成本到成本。在应用到具有挑战性的拼接物体回收操作任务时,PMBS比连续的 MCTS 执行质量提高了30美元。我们显示,PBS可以直接应用于实际的机器人硬件,但微小的Sim-to-真实差异。包括视频在内的补充材料可以在https://github.com/arc-l/pmbs找到。