在不确定性下利用分布式比值利用分布式比值避免反应碰撞:核心嵌入办法 (Leveraging Distributional Bias for Reactive Collision Avoidance under Uncertainty: A Kernel Embedding Approach)

Many commodity sensors that measure the robot and dynamic obstacle's state have non-Gaussian noise characteristics. Yet, many current approaches treat the underlying-uncertainty in motion and perception as Gaussian, primarily to ensure computational tractability. On the other hand, existing planners working with non-Gaussian uncertainty do not shed light on leveraging distributional characteristics of motion and perception noise, such as bias for efficient collision avoidance. This paper fills this gap by interpreting reactive collision avoidance as a distribution matching problem between the collision constraint violations and Dirac Delta distribution. To ensure fast reactivity in the planner, we embed each distribution in Reproducing Kernel Hilbert Space and reformulate the distribution matching as minimizing the Maximum Mean Discrepancy (MMD) between the two distributions. We show that evaluating the MMD for a given control input boils down to just matrix-matrix products. We leverage this insight to develop a simple control sampling approach for reactive collision avoidance with dynamic and uncertain obstacles. We advance the state-of-the-art in two respects. First, we conduct an extensive empirical study to show that our planner can infer distributional bias from sample-level information. Consequently, it uses this insight to guide the robot to good homotopy. We also highlight how a Gaussian approximation of the underlying uncertainty can lose the bias estimate and guide the robot to unfavorable states with a high collision probability. Second, we show tangible comparative advantages of the proposed distribution matching approach for collision avoidance with previous non-parametric and Gaussian approximated methods of reactive collision avoidance.

翻译：测量机器人和动态障碍状态的许多商品传感器都具有非Gaussian噪音特性。然而,许多现行办法将基本不确定性和感知的不确定性作为Gausian处理,主要是为了确保计算性。另一方面,与非Gaussian不确定性合作的现有规划者并没有说明如何利用运动和感知噪音的分布特性,例如对有效避免碰撞的偏差。本文弥补了这一差距,将反应性避免碰撞解释为碰撞限制违约和Dirac Delta分布之间的分配匹配问题。为确保规划者迅速重新活跃,我们将每次分布都嵌入Renel Cernel Hilbert空间,并将分布匹配重新定位为最大限度地减少两种分布之间的最大平均值差异。我们表明,为某种特定控制输入而评价MMD的分布特性,会把控制性输入压缩到仅仅是矩阵式产品。我们利用这一洞察力来发展一种简单的控制抽样方法,用动态和不确定的障碍来避免反应性碰撞。我们从两个方面推进了目前的状况。首先,我们进行了一项广泛的实验性研究,以显示我们的计划在高比值的准确性分析中能显示我们是如何用高的概率分析。