Partially-observable problems pose a trade-off between reducing costs and gathering information. They can be solved optimally by planning in belief space, but that is often prohibitively expensive. Model-predictive control (MPC) takes the alternative approach of using a state estimator to form a belief over the state, and then plan in state space. This ignores potential future observations during planning and, as a result, cannot actively increase or preserve the certainty of its own state estimate. We find a middle-ground between planning in belief space and completely ignoring its dynamics by only reasoning about its future accuracy. Our approach, filter-aware MPC, penalises the loss of information by what we call "trackability", the expected error of the state estimator. We show that model-based simulation allows condensing trackability into a neural network, which allows fast planning. In experiments involving visual navigation, realistic every-day environments and a two-link robot arm, we show that filter-aware MPC vastly improves regular MPC.
翻译:局部可观察问题存在降低成本和收集信息之间的折衷。通过在信仰空间内进行规划可以最优地解决这些问题,但这通常代价很高。模型预测控制(MPC)采取了另一种方法,使用状态估计器形成状态信仰,然后在状态空间内进行规划。这在规划期间忽略了潜在的未来观测结果,因此无法主动增加或保持其自身状态估计的确定性。我们在在信仰空间内进行规划和完全忽略其动态之间找到了一个折中点,只考虑其未来准确性。我们的方法- 过滤器感知MPC,通过我们所称的“可追踪性”(状态估计器的期望误差)惩罚信息丢失。我们证明了模型模拟可以将可追踪性缩减为神经网络,从而实现快速的规划。在涉及到视觉导航、逼真的日常环境和两节链接机械臂的实验中,我们展示了Filter-Aware MPC极大地改善了常规MPC。