In Machine Learning (ML)-enabled autonomous systems (MLASs), it is essential to identify the hazard boundary of ML Components (MLCs) in the MLAS under analysis. Given that such boundary captures the conditions in terms of MLC behavior and system context that can lead to hazards, it can then be used to, for example, build a safety monitor that can take any predefined fallback mechanisms at runtime when reaching the hazard boundary. However, determining such hazard boundary for an ML component is challenging. This is due to the space combining system contexts (i.e., scenarios) and MLC behaviors (i.e., inputs and outputs) being far too large for exhaustive exploration and even to handle using conventional metaheuristics, such as genetic algorithms. Additionally, the high computational cost of simulations required to determine any MLAS safety violations makes the problem even more challenging. Furthermore, it is unrealistic to consider a region in the problem space deterministically safe or unsafe due to the uncontrollable parameters in simulations and the non-linear behaviors of ML models (e.g., deep neural networks) in the MLAS under analysis. To address the challenges, we propose MLCSHE (ML Component Safety Hazard Envelope), a novel method based on a Cooperative Co-Evolutionary Algorithm (CCEA), which aims to tackle a high-dimensional problem by decomposing it into two lower-dimensional search subproblems. Moreover, we take a probabilistic view of safe and unsafe regions and define a novel fitness function to measure the distance from the probabilistic hazard boundary and thus drive the search effectively. We evaluate the effectiveness and efficiency of MLCSHE on a complex Autonomous Vehicle (AV) case study. Our evaluation results show that MLCSHE is significantly more effective and efficient compared to a standard genetic algorithm and random search.
翻译:在机器学习(ML)带动的自动系统(MLAS)中,必须查明正在分析的MALS中ML构件(MLCs)的危险界限;鉴于这种边界能够捕捉到刚果解放运动行为和系统环境中可能导致危险的条件,因此可以用来例如建立一个安全监测器,在到达危险边界时,可以在运行时采用任何预先定义的后退机制;然而,为ML构件确定这种危险界限具有挑战性。这是因为系统环境(即,情景)和刚果解放运动行为(即,投入和产出)的结合空间太大,无法进行详尽的探索,甚至无法使用传统的计量仪(如遗传算法)处理。此外,确定任何MALS安全违规行为所需的高计算成本使得问题更加具有挑战性。我们不现实地认为问题所在区域具有威慑性的安全性或不安全性,因为模拟中无法控制的因素以及ML模型的非线性行为(eg.,深度的直线性投入和产出)太大,无法进行彻底的勘探,甚至更精确地分析,因此,MLALS(O-MLS)的高级搜索和稳定数据分析中的一项数据分析需要提出一个更高的数据分析。