Despite recent, independent progress in model-based reinforcement learning and integrated symbolic-geometric robotic planning, synthesizing these techniques remains challenging because of their disparate assumptions and strengths. In this work, we take a step toward bridging this gap with Neuro-Symbolic Relational Transition Models (NSRTs), a novel class of transition models that are data-efficient to learn, compatible with powerful robotic planning methods, and generalizable over objects. NSRTs have both symbolic and neural components, enabling a bilevel planning scheme where symbolic AI planning in an outer loop guides continuous planning with neural models in an inner loop. Experiments in four robotic planning domains show that NSRTs can be learned after only tens or hundreds of training episodes, and then used for fast planning in new tasks that require up to 60 actions to reach the goal and involve many more objects than were seen during training. Video: https://tinyurl.com/chitnis-nsrts
翻译:尽管最近在基于模型的强化学习和综合象征性地球测量机器人规划方面取得了独立进展,但综合这些技术仍因其不同的假设和长处而具有挑战性。在这项工作中,我们迈出了一步,以弥补与Neuro-Symblic 关系过渡模型(NSRTs)(NSRTs)的这一差距。 Neuro-Symblical 关系过渡模型(NSRTs)是一个新型的过渡模型,它具有数据效率,可以学习,与强大的机器人规划方法相兼容,而且可以普遍适用。NSRTs具有象征和神经两个部分,使得双级规划计划能够使外部循环中象征性的AI规划能够指导内循环中神经模型的连续规划。四个机器人规划领域的实验表明,在经过数万或数百次培训之后,可以学习NSRTs,然后用于快速规划新任务,这些新任务需要多达60项行动才能实现目标,而且涉及比培训期间看到的更多的物体。视频:https://tinyurl.com/chitnis-norts。