We consider the problem of security-aware planning in an unknown stochastic environment, in the presence of attacks on control signals (i.e., actuators) of the robot. We model the attacker as an agent who has the full knowledge of the controller as well as the employed intrusion-detection system and who wants to prevent the controller from performing tasks while staying stealthy. We formulate the problem as a stochastic game between the attacker and the controller and present an approach to express the objective of such an agent and the controller as a combined linear temporal logic (LTL) formula. We then show that the planning problem, described formally as the problem of satisfying an LTL formula in a stochastic game, can be solved via model-free reinforcement learning when the environment is completely unknown. Finally, we illustrate and evaluate our methods on two robotic planning case studies.
翻译:在机器人控制信号(即动画师)受到攻击的情况下,我们考虑在未知的随机环境中的安全意识规划问题,我们把攻击者模拟为完全了解控制器和已启用的入侵探测系统的代理人,并想阻止控制器在隐形时执行任务的代理人。我们把这个问题描述为攻击者与控制器之间的一种随机游戏,提出一种方法,将这种代理人和控制器的目标表述为一种合并的线性时间逻辑(LTL)公式。我们然后表明,在环境完全未知时,可以通过无型强化学习来解决正式称为在随机游戏中满足LTL公式的问题的规划问题。最后,我们用两种机器人规划案例研究来说明和评估我们的方法。