While artificial-intelligence-based methods suffer from lack of transparency, rule-based methods dominate in safety-critical systems. Yet, the latter cannot compete with the first ones in robustness to multiple requirements, for instance, simultaneously addressing safety, comfort, and efficiency. Hence, to benefit from both methods they must be joined in a single system. This paper proposes a decision making and control framework, which profits from advantages of both the rule- and machine-learning-based techniques while compensating for their disadvantages. The proposed method embodies two controllers operating in parallel, called Safety and Learned. A rule-based switching logic selects one of the actions transmitted from both controllers. The Safety controller is prioritized every time, when the Learned one does not meet the safety constraint, and also directly participates in the safe Learned controller training. Decision making and control in autonomous driving is chosen as the system case study, where an autonomous vehicle learns a multi-task policy to safely cross an unprotected intersection. Multiple requirements (i.e., safety, efficiency, and comfort) are set for vehicle operation. A numerical simulation is performed for the proposed framework validation, where its ability to satisfy the requirements and robustness to changing environment is successfully demonstrated.
翻译:虽然以人为智能为基础的方法缺乏透明度,但以规则为基础的方法在安全关键系统中占主导地位,但后者不能与第一个方法竞争,因为前者在稳健性方面符合多种要求,例如,同时处理安全、舒适和效率,因此,要从这两种方法中获益,就必须将两者合并为一个单一系统。本文件提议了一个决策和控制框架,从基于规则的和基于机器的学习技术的优势中获益,同时补偿其缺点。拟议方法包含两个平行运作的控制器,称为安全和熟练。基于规则的转换逻辑从两个控制器中选择一个动作。安全控制器每次优先排序,当一个执行者不满足安全限制,也直接参与安全操作器的培训。选择了自主驾驶的决策和控制作为系统案例研究,让一个自主车辆学习多任务政策,安全地跨越不受保护的交叉点。为车辆操作设定了多种要求(即安全、效率和舒适)。为拟议的框架验证工作进行了数字模拟,其满足要求的能力和健全环境变化的能力是成功的。