Learning-based methods have shown promising performance for accelerating motion planning, but mostly in the setting of static environments. For the more challenging problem of planning in dynamic environments, such as multi-arm assembly tasks and human-robot interaction, motion planners need to consider the trajectories of the dynamic obstacles and reason about temporal-spatial interactions in very large state spaces. We propose a GNN-based approach that uses temporal encoding and imitation learning with data aggregation for learning both the embeddings and the edge prioritization policies. Experiments show that the proposed methods can significantly accelerate online planning over state-of-the-art complete dynamic planning algorithms. The learned models can often reduce costly collision checking operations by more than 1000x, and thus accelerating planning by up to 95%, while achieving high success rates on hard instances as well.
翻译:以学习为基础的方法在加速运动规划方面表现出了有希望的绩效,但大多是在静态环境中。对于在动态环境中进行规划这一更具挑战性的问题,如多武器组装任务和人-机器人互动,运动规划者需要考虑在非常大的州空间中时间空间互动的动态障碍和原因的轨迹。我们建议采用基于全球网络的方法,利用时间编码和模拟学习与数据汇总相结合的方法,学习嵌入和边缘优先排序政策。实验表明,拟议的方法可以大大加快对最先进的全动态规划算法的在线规划。 所学的模型往往可以将成本高昂的碰撞检查操作减少1000倍以上,从而加速规划高达95%,同时在困难情况下也实现高成功率。