Successful robotic operation in stochastic environments relies on accurate characterization of the underlying probability distributions, yet this is often imperfect due to limited knowledge. This work presents a control algorithm that is capable of handling such distributional mismatches. Specifically, we propose a novel nonlinear MPC for distributionally robust control, which plans locally optimal feedback policies against a worst-case distribution within a given KL divergence bound from a Gaussian distribution. Leveraging mathematical equivalence between distributionally robust control and risk-sensitive optimal control, our framework also provides an algorithm to dynamically adjust the risk-sensitivity level online for risk-sensitive control. The benefits of the distributional robustness as well as the automatic risk-sensitivity adjustment are demonstrated in a dynamic collision avoidance scenario where the predictive distribution of human motion is erroneous.
翻译:在随机环境中成功的机器人操作依赖于对潜在概率分布的准确描述,但由于知识有限,这往往不完美。这项工作提出了一个控制算法,能够处理这种分布不匹配。具体地说,我们提议采用一种新的非线性MPC来进行分配稳健控制,根据与高斯分布相约束的某个特定 KL 差异中最坏的分布情况,规划当地最佳反馈政策。利用分布稳健的控制和风险敏感度最佳控制之间的数学等值,我们的框架还提供一种算法来动态调整在线风险敏感度,以进行风险敏感控制。分布稳健性的好处以及自动风险敏感度调整在动态避免碰撞的假设中表现出来,在这种假设中,人类运动的预测分布是错误的。