Cyber-physical systems (CPS) have been broadly deployed in safety-critical domains, such as automotive systems, avionics, medical devices, etc. In recent years, Artificial Intelligence (AI) has been increasingly adopted to control CPS. Despite the popularity of AI-enabled CPS, few benchmarks are publicly available. There is also a lack of deep understanding on the performance and reliability of AI-enabled CPS across different industrial domains. To bridge this gap, we initiate to create a public benchmark of industry-level CPS in seven domains and build AI controllers for them via state-of-the-art deep reinforcement learning (DRL) methods. Based on that, we further perform a systematic evaluation of these AI-enabled systems with their traditional counterparts to identify the current challenges and explore future opportunities. Our key findings include (1) AI controllers do not always outperform traditional controllers, (2) existing CPS testing techniques (falsification, specifically) fall short of analyzing AI-enabled CPS, and (3) building a hybrid system that strategically combines and switches between AI controllers and traditional controllers can achieve better performance across different domains. Our results highlight the need for new testing techniques for AI-enabled CPS and the need for more investigations into hybrid CPS systems to achieve optimal performance and reliability.
翻译:近些年来,人们越来越多地采用人工智能(AI)来控制计算机辅助装置。尽管由AI支持的计算机辅助装置受到欢迎,但很少有可公开使用的基准。对于由AI支持的计算机辅助装置在不同工业领域的业绩和可靠性也缺乏深刻了解。为了缩小这一差距,我们开始在七个领域建立工业级计算机辅助装置的公共基准,并通过最先进的深层强化学习(DRL)方法为这些系统建立AI控制器。在此基础上,我们进一步与传统对应方对这些由AI支持的系统进行系统评估,以查明当前的挑战和探索未来的机会。我们的主要结论包括:(1) AI控制器并非总是超越传统的控制器,(2) 现有的计算机辅助装置测试技术(falsization,具体来说)少于对由AI支持的计算机辅助装置的分析,以及(3) 建立一个混合系统,使AI控制器和传统控制器之间在战略上进行整合和交换,从而能够在不同的领域实现更好的性能。我们的成果突出表明,需要将新的性能测试技术转化为C-CPS。