In many domains such as transportation and logistics, search and rescue, or cooperative surveillance, tasks are pending to be allocated with the consideration of possible execution uncertainties. Existing task coordination algorithms either ignore the stochastic process or suffer from the computational intensity. Taking advantage of the weakly coupled feature of the problem and the opportunity for coordination in advance, we propose a decentralized auction-based coordination strategy using a newly formulated score function which is generated by forming the problem into task-constrained Markov decision processes (MDPs). The proposed method guarantees convergence and at least 50% optimality in the premise of a submodular reward function. Furthermore, for the implementation on large-scale applications, an approximate variant of the proposed method, namely Deep Auction, is also suggested with the use of neural networks, which is evasive of the troublesome for constructing MDPs. Inspired by the well-known actor-critic architecture, two Transformers are used to map observations to action probabilities and cumulative rewards respectively. Finally, we demonstrate the performance of the two proposed approaches in the context of drone deliveries, where the stochastic planning for the drone league is cast into a stochastic price-collecting Vehicle Routing Problem (VRP) with time windows. Simulation results are compared with state-of-the-art methods in terms of solution quality, planning efficiency and scalability.
翻译:在运输和后勤、搜索和救援或合作监督等许多领域,任务尚待分配,同时考虑到可能的执行不确定性; 现有任务协调算法要么忽略了随机过程,要么受到计算强度的影响; 利用现有任务协调算法,利用这一问题的薄弱结合特点和事先协调的机会,我们提议采用基于拍卖的分散化协调战略,利用新拟订的评分功能,将问题形成受任务制约的马尔科夫决策流程(MDPs),提议的方法保证趋同,在次级模式奖励功能前提下至少50%的最佳性; 此外,对于大规模应用的实施,建议的方法的一种近似变式,即深拍卖,也建议采用神经网络,因为神经网络对建造MDPs造成麻烦。在众所周知的行为者-critict结构的启发下,使用两个变压器来绘制观测结果,以采取行动的概率和累积性奖励。 最后,我们展示了拟议在无人驾驶飞机交付方面采用两种方法的绩效,即采用大规模应用方法的大致变压式方法,即深拍卖法,即采用拟议方法,即采用神经网络网络,这是用来避免制造MICS-R-R-S-S-S-S-S-S-S-S-S-R-S-S-S-S-S-S-R-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S