Cyber-physical systems (CPS) incorporate the complex and large-scale engineered systems behind critical infrastructure operations, such as water distribution networks, energy delivery systems, healthcare services, manufacturing systems, and transportation networks. Industrial CPS in particular need to simultaneously satisfy requirements of available, secure, safe and reliable system operation against diverse threats, in an adaptive and sustainable way. These adverse events can be of accidental or malicious nature and may include natural disasters, hardware or software faults, cyberattacks, or even infrastructure design and implementation faults. They may drastically affect the results of CPS algorithms and mechanisms, and subsequently the operations of industrial control systems (ICS) deployed in those critical infrastructures. Such a demanding combination of properties and threats calls for resilience-enhancement methodologies and techniques, working in real-time operation. However, the analysis of CPS resilience is a difficult task as it involves evaluation of various interdependent layers with heterogeneous computing equipment, physical components, network technologies, and data analytics. In this paper, we apply the principles of chaos engineering (CE) to industrial CPS, in order to demonstrate the benefits of such practices on system resilience. The systemic uncertainty of adverse events can be tamed by applying runtime CE-based analyses to CPS in production, in order to predict environment changes and thus apply mitigation measures limiting the range and severity of the event, and minimizing its blast radius.
翻译:这些不利事件可能具有意外或恶意性质,可能包括自然灾害、硬件或软件故障、网络攻击,甚至基础设施设计和实施失误;它们可能极大地影响中央采购事务处的算法和机制的结果,以及随后在这些关键基础设施中部署的工业控制系统(工业控制系统)的运作;这种要求很高的地产和威胁组合要求采用增强复原力的方法和技术,实时操作;然而,对中央采购事务处的复原力分析是一项艰巨的任务,因为它涉及评价各种相互依存的层层,包括混杂的计算设备、物理部件、网络技术和数据分析;在本文件中,我们对工业中央采购事务处采用混乱工程原理,以展示这种做法对系统复原力的好处;对中央采购事务处的系统性和稳定性,对中央采购事务处的系统性不确定性进行系统性分析,从而限制中央采购司的稳定性。