不确定性下多试剂追求 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - (Pursuer Assignment and Control Strategies in Multi-agent Pursuit-Evasion Under Uncertainties)

翻译：不确定性下多试剂追求 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Leiming Zhang,Amanda Prorok,Subhrajit Bhattacharya

from arxiv, 14 pages, 6 figures

We consider a pursuit-evasion problem with a heterogeneous team of multiple pursuers and multiple evaders. Although both the pursuers (robots) and the evaders are aware of each others' control and assignment strategies, they do not have exact information about the other type of agents' location or action. Using only noisy on-board sensors the pursuers (or evaders) make probabilistic estimation of positions of the evaders (or pursuers). Each type of agent use Markov localization to update the probability distribution of the other type. A search-based control strategy is developed for the pursuers that intrinsically takes the probability distribution of the evaders into account. Pursuers are assigned using an assignment algorithm that takes redundancy (i.e., an excess in the number of pursuers than the number of evaders) into account, such that the total or maximum estimated time to capture the evaders is minimized. In this respect we assume the pursuers to have clear advantage over the evaders. However, the objective of this work is to use assignment strategies that minimize the capture time. This assignment strategy is based on a modified Hungarian algorithm as well as a novel algorithm for determining assignment of redundant pursuers. The evaders, in order to effectively avoid the pursuers, predict the assignment based on their probabilistic knowledge of the pursuers and use a control strategy to actively move away from those pursues. Our experimental evaluation shows that the redundant assignment algorithm performs better than an alternative nearest-neighbor based assignment algorithm.

翻译：我们考虑的是由多个追逐者和多个逃避者组成的不同追逐团队的逃避追逐问题。虽然追逐者(罗博特人)和逃避者都了解对方的控制和派任策略,但他们并不掌握关于其他类型代理人所在地或行动的准确信息。追逐者(或逃避者)在机上传感器上只发出噪音,对逃避者(或追逐者)的位置进行概率估计。每种追逐者都使用Markov本地化来更新另一类追赶者的概率分布。为追踪者制定了一种基于搜索的控制策略,从本质上将逃避者的概率分布考虑在内。追逐者和逃避者被分配使用一种分配算法,而考虑到冗余(即追逐者人数多于逃避者人数),因此,追逐者(或逃避者)的总数或最大估计时间被最小化。在这方面,我们假定追赶者对逃避者有明确的优势。然而,这项工作的目的是使用分配战略来尽可能减少捕捉取时间。这项任务战略的基础是修改的匈牙利递解运者算法,以此作为一种新的递解算法,作为确定追逐者稳定性战略的新的递后,即追赶者采用追赶者的排序。