机器人对抗情境安全测试：比较和最佳攻击性的研究 (On the Adversarial Scenario-based Safety Testing of Robots: the Comparability and Optimal Aggressiveness)

This paper studies the class of scenario-based safety testing algorithms in the black-box safety testing configuration. For algorithms sharing the same state-action set coverage with different sampling distributions, it is commonly believed that prioritizing the exploration of high-risk state-actions leads to a better sampling efficiency. Our proposal disputes the above intuition by introducing an impossibility theorem that provably shows all safety testing algorithms of the aforementioned difference perform equally well with the same expected sampling efficiency. Moreover, for testing algorithms covering different sets of state-actions, the sampling efficiency criterion is no longer applicable as different algorithms do not necessarily converge to the same termination condition. We then propose a testing aggressiveness definition based on the almost safe set concept along with an unbiased and efficient algorithm that compares the aggressiveness between testing algorithms. Empirical observations from the safety testing of bipedal locomotion controllers and vehicle decision-making modules are also presented to support the proposed theoretical implications and methodologies.

翻译：本文研究了黑盒安全测试中的一类情境安全测试算法。针对具有相同状态-动作覆盖率但不同采样分布的算法，通常认为优先探索高风险状态-动作可以提高采样效率。本文提出一项不可能定理，可以证明上述直觉是错误的，并且证明了这些差异的安全测试算法在期望的采样效率相同情况下会表现出相同的性能。此外，针对涵盖不同状态 - 动作集合的测试算法，采样效率准则不再适用，因为不同算法不一定会收敛到相同的终止条件。因此，我们提出了一种基于几乎安全集概念的测试攻击策略定义，以及一种无偏和高效的算法，用于比较测试算法之间的攻击性。此外，还提供了使用该理论和方法的案例分析，包括对于两足步行控制器和车辆决策模块的安全测试的实验数据。