利用具有高秩序地形学抽象学的图形强化学习,对多机器人集体运输进行高效规划</s> (Efficient Planning of Multi-Robot Collective Transport using Graph Reinforcement Learning with Higher Order Topological Abstraction)

Efficient multi-robot task allocation (MRTA) is fundamental to various time-sensitive applications such as disaster response, warehouse operations, and construction. This paper tackles a particular class of these problems that we call MRTA-collective transport or MRTA-CT -- here tasks present varying workloads and deadlines, and robots are subject to flight range, communication range, and payload constraints. For large instances of these problems involving 100s-1000's of tasks and 10s-100s of robots, traditional non-learning solvers are often time-inefficient, and emerging learning-based policies do not scale well to larger-sized problems without costly retraining. To address this gap, we use a recently proposed encoder-decoder graph neural network involving Capsule networks and multi-head attention mechanism, and innovatively add topological descriptors (TD) as new features to improve transferability to unseen problems of similar and larger size. Persistent homology is used to derive the TD, and proximal policy optimization is used to train our TD-augmented graph neural network. The resulting policy model compares favorably to state-of-the-art non-learning baselines while being much faster. The benefit of using TD is readily evident when scaling to test problems of size larger than those used in training.

翻译：高效的多机器人任务分配(MRTA)对于灾害应对、仓储作业和建筑等各种时间敏感应用至关重要。本文件处理的是我们称之为MRTTA-集体运输或MRTA-CT -- -- 这里的任务工作量和期限各不相同,机器人受到飞行范围、通信范围和有效载荷的限制。对于涉及100-1000的任务和10-100个机器人的任务和10-100个机器人的大型问题来说,传统的非学习解决方案往往缺乏时间效率,而新出现的基于学习的政策在不进行费用高昂的再培训的情况下不足以解决更大范围的问题。为了解决这一差距,我们使用最近提出的一个包含Capsule网络和多头关注机制的编码-解析图形神经网络,并创新地添加表层描述器(TD),作为提高向类似和较大规模的无形问题的可转移性的新特征。使用持久性同性同性分析法来得出TD- 和准性政策优化用于培训我们的TD- 图表神经网络。因此产生的政策模型模型比采用州级标准规模的测试问题要快得多。</s>