Data-driven models such as neural networks are being applied more and more to safety-critical applications, such as the modeling and control of cyber-physical systems. Despite the flexibility of the approach, there are still concerns about the safety of these models in this context, as well as the need for large amounts of potentially expensive data. In particular, when long-term predictions are needed or frequent measurements are not available, the open-loop stability of the model becomes important. However, it is difficult to make such guarantees for complex black-box models such as neural networks, and prior work has shown that model stability is indeed an issue. In this work, we consider an aluminum extraction process where measurements of the internal state of the reactor are time-consuming and expensive. We model the process using neural networks and investigate the role of including skip connections in the network architecture as well as using l1 regularization to induce sparse connection weights. We demonstrate that these measures can greatly improve both the accuracy and the stability of the models for datasets of varying sizes.
翻译:神经网络等数据驱动模型越来越多地被用于安全关键应用,例如网络物理系统的建模和控制。尽管这种方法具有灵活性,但仍然对这些模型的安全性感到关切,而且需要大量可能昂贵的数据。特别是,当需要长期预测或缺乏经常测量数据时,模型的开放环稳定性变得非常重要。然而,很难为神经网络等复杂的黑盒模型提供这种保障,而先前的工作表明模型稳定性确实是一个问题。在这项工作中,我们考虑的是一个铝提取过程,即对反应堆内部状态的测量耗费时间和昂贵。我们用神经网络来模拟这一过程,并调查在网络结构中包括跳过连接以及使用l1调节来诱发微小连接权重的作用。我们证明,这些措施可以大大提高不同尺寸数据集模型的准确性和稳定性。