Embodied agents in vision navigation coupled with deep neural networks have attracted increasing attention. However, deep neural networks are vulnerable to malicious adversarial noises, which may potentially cause catastrophic failures in Embodied Vision Navigation. Among these adversarial noises, universal adversarial perturbations (UAP), i.e., the image-agnostic perturbation applied on each frame received by the agent, are more critical for Embodied Vision Navigation since they are computation-efficient and application-practical during the attack. However, existing UAP methods do not consider the system dynamics of Embodied Vision Navigation. For extending UAP in the sequential decision setting, we formulate the disturbed environment under the universal noise $\delta$, as a $\delta$-disturbed Markov Decision Process ($\delta$-MDP). Based on the formulation, we analyze the properties of $\delta$-MDP and propose two novel Consistent Attack methods for attacking Embodied agents, which first consider the dynamic of the MDP by estimating the disturbed Q function and the disturbed distribution. In spite of victim models, our Consistent Attack can cause a significant drop in the performance for the Goalpoint task in habitat. Extensive experimental results indicate that there exist potential risks for applying Embodied Vision Navigation methods to the real world.
翻译:然而,深神经网络容易受到恶意对抗性噪音的侵扰,这可能会在Embidi Vision导航中造成灾难性的失败。在这些对抗性噪音中,普遍对抗性扰动(UAP),即对代理人收到的每个框架应用的图像-不可知性扰动(UAP),对于Embidi Vision导航更为关键,因为它们在攻击期间是计算效率高的和应用实用的,但是,现有的UAP方法并不考虑Embidio Vision导航的系统动态。为了在连续决策中扩大UAP,我们根据普遍噪音($\delta$)来设计混乱的环境,作为美元-dropped Markov 决策过程($\delta$-MDP)。根据这一提法,我们分析了$delta-MDP的特性,并提出了两种全新的攻击Empidive-PDP的一致攻击方法,这些方法首先考虑MDP的动态,通过估计扰动的Q功能和扰动性分布在顺序决策中。我们根据普遍噪音($delta$-dribbed Markov marit Margium)来设计一个全球视野模型。