In recent years, camera-based 3D object detection has gained widespread attention for its ability to achieve high performance with low computational cost. However, the robustness of these methods to adversarial attacks has not been thoroughly examined. In this study, we conduct the first comprehensive investigation of the robustness of leading camera-based 3D object detection methods under various adversarial conditions. Our experiments reveal five interesting findings: (a) the use of accurate depth estimation effectively improves robustness; (b) depth-estimation-free approaches do not show superior robustness; (c) bird's-eye-view-based representations exhibit greater robustness against localization attacks; (d) incorporating multi-frame benign inputs can effectively mitigate adversarial attacks; and (e) addressing long-tail problems can enhance robustness. We hope our work can provide guidance for the design of future camera-based object detection modules with improved adversarial robustness.
翻译:近年来,基于摄像的三维物体探测因其以低计算成本实现高性能的能力而得到广泛关注,然而,这些对抗性攻击方法的稳健性尚未得到彻底研究;在本研究中,我们对各种对抗性条件下基于摄像的三维物体探测主要方法的稳健性进行了第一次全面调查;我们的实验揭示了五个有趣的发现:(a) 准确深度估计的使用有效地提高了稳健性;(b) 无深度估计方法并不显示强健性;(c) 鸟类视视面显示对地方化攻击的稳健性较强;(d) 纳入多框架良性投入可以有效减轻对抗性攻击;(e) 解决长尾问题可以加强稳健性;我们希望我们的工作能够为设计基于摄影性物体探测模块提供指导,提高对抗性强性。