We are motivated by the problem of performing failure prediction for safety-critical robotic systems with high-dimensional sensor observations (e.g., vision). Given access to a blackbox control policy (e.g., in the form of a neural network) and a dataset of training environments, we present an approach for synthesizing a failure predictor with guaranteed bounds on false-positive and false-negative errors. In order to achieve this, we utilize techniques from Probably Approximately Correct (PAC)-Bayes generalization theory. In addition, we present novel class-conditional bounds that allow us to tradeoff the relative rates of false-positive vs. false-negative errors. We propose algorithms that train failure predictors (that take as input the history of sensor observations) by minimizing our theoretical error bounds. We demonstrate the resulting approach using extensive simulation and hardware experiments for vision-based navigation with a drone and grasping objects with a robotic manipulator equipped with a wrist-mounted RGB-D camera. These experiments illustrate the ability of our approach to (1) provide strong bounds on failure prediction error rates (that closely match empirical error rates), and (2) improve safety by predicting failures.
翻译:我们的动机是,对具有高维传感器观测(例如视觉)的安全临界机器人系统进行故障预测的问题。鉴于获得黑盒控制政策(例如以神经网络的形式)和训练环境数据集,我们提出了一个方法,将故障预测综合起来,对假阳性和假阴性误差进行有保证的界限。为了实现这一点,我们利用了可能大致正确(PAC)-贝耶斯一般化理论的技术。此外,我们提出了新的等级条件界限,使我们能够权衡假阳性相对于假阴性误差的相对比率。我们提出了通过尽量减少我们的理论错误界限来培训故障预测器(作为传感器观测历史的投入)的算法。我们用无人机进行广泛的模拟和硬件实验,并用配备手腕-GB-D照相机的机器人操纵器控制物体。这些实验表明,我们的方法能够(1) 提供故障预测误差率的严格界限(与实验误差率密切吻合),(2) 改进安全性,通过精确的预测率提高安全性。