Demand for high-performance, robust, and safe autonomous systems has grown substantially in recent years. These objectives motivate the desire for efficient safety-theoretic reasoning that can be embedded in core decision-making tasks such as motion planning, particularly in constrained environments. On one hand, Monte-Carlo (MC) and other sampling-based techniques provide accurate collision probability estimates for a wide variety of motion models but are cumbersome in the context of continuous optimization. On the other, "direct" approximations aim to compute (or upper-bound) the failure probability as a smooth function of the decision variables, and thus are convenient for optimization. However, existing direct approaches fundamentally assume discrete-time dynamics and can perform unpredictably when applied to continuous-time systems ubiquitous in the real world, often manifesting as severe conservatism. State-of-the-art attempts to address this within a conventional discrete-time framework require additional Gaussianity approximations that ultimately produce inconsistency of their own. In this paper we take a fundamentally different approach, deriving a risk approximation framework directly in continuous time and producing a lightweight estimate that actually converges as the underlying discretization is refined. Our approximation is shown to significantly outperform state-of-the-art techniques in replicating the MC estimate while maintaining the functional and computational benefits of a direct method. This enables robust, risk-aware, continuous motion-planning for a broad class of nonlinear and/or partially-observable systems.
翻译:近些年来,对高性能、强健和安全自主系统的需求大幅增长。这些目标激发了对高效安全理论推理的渴望,这种推理可以植根于核心决策任务,例如运动规划,特别是在受限制的环境中。一方面,蒙特卡洛(MC)和其他取样技术为各种各样的运动模型提供了准确的碰撞概率估计,但在连续优化方面却十分繁琐。另一方面,“直接”近比旨在将失败概率作为决定变量的顺利功能进行计算(或上限),从而便于优化。然而,现有的直接方法从根本上承担了离散时间动态,在应用到现实世界中无处不在的连续时间系统时,可以不可预测地执行。蒙特卡洛(MC)和其他基于取样的技术往往表现为严重的保守主义。在传统的离散时间框架内解决这一问题,需要额外的高压近似性近似,最终造成其自身的不一致。在本文件中,我们采取了一种根本上不同的方法,直接在连续的时间里得出一个风险近似框架,并产生一种较轻的重量估计,当适用于现实世界中无处的系统时,这种精确的估算,而功能性推算法则显示离散的精确的精确的精确的计算方法是精确的精确的精确的调整。