Deep reinforcement learning (DRL) is one of the most popular algorithms to realize an autonomous driving (AD) system. The key success factor of DRL is that it embraces the perception capability of deep neural networks which, however, have been proven vulnerable to Trojan attacks. Trojan attacks have been widely explored in supervised learning (SL) tasks (e.g., image classification), but rarely in sequential decision-making tasks solved by DRL. Hence, in this paper, we explore Trojan attacks on DRL for AD tasks. First, we propose a spatio-temporal DRL algorithm based on the recurrent neural network and attention mechanism to prove that capturing spatio-temporal traffic features is the key factor to the effectiveness and safety of a DRL-augment AD system. We then design a spatial-temporal Trojan attack on DRL policies, where the trigger is hidden in a sequence of spatial and temporal traffic features, rather than a single instant state used in existing Trojan on SL and DRL tasks. With our Trojan, the adversary acts as a surrounding normal vehicle and can trigger attacks via specific spatial-temporal driving behaviors, rather than physical or wireless access. Through extensive experiments, we show that while capturing spatio-temporal traffic features can improve the performance of DRL for different AD tasks, they suffer from Trojan attacks since our designed Trojan shows high stealthy (various spatio-temporal trigger patterns), effective (less than 3.1\% performance variance rate and more than 98.5\% attack success rate), and sustainable to existing advanced defenses.
翻译:深入强化学习(DRL)是实现自主驱动(AD)系统最受欢迎的算法之一。DRL的关键成功因素是它包含深神经网络的感知能力,但事实证明这些网络很容易受到Trojan袭击。Trojan袭击在监管的学习(SL)任务(例如图像分类)中得到了广泛的探索,但在DRL解决的顺序决策任务中却很少被探索。因此,我们在本文件中探讨了Trojan对DRL进行自动任务时对DRL进行的攻击。首先,我们建议基于经常性神经网络和关注机制,对深神经网络的感知能力进行spatio-时空通信功能,以证明捕捉Sasto-时空通信功能是DR-AD系统有效性和安全性能的关键因素。我们随后设计了对DRL政策进行空间-时空袭击,其触发因素隐藏在空间和时空通信的序列中,而不是在现有的TroVL和DRL任务中使用的一瞬状态(SL和DRL任务。我们的Trojan,其相对性动作行为作为一种围绕正常交通运行的有效行为的行为, 而不是通过特定的空间-rompreval 动作, 动作可以显示现有的性攻击,从空间-trade-trade-trade-trade-traal 动作,而可以显示不同的性攻击,而显示不同的性攻击,而显示不同的性攻击的性攻击的高度-trastal-trastal-traal-trastral-trastral-trastral-trastral-trastral-tra) 性攻击,而可以显示不同的性攻击的高度性能显示现有的性能过程。