Fault diagnosis of dynamic systems is done by detecting changes in time-series data, for example residuals, caused by system degradation and faulty components. The use of general-purpose multi-class classification methods for fault diagnosis is complicated by imbalanced training data and unknown fault classes. Another complicating factor is that different fault classes can result in similar residual outputs, especially for small faults, which causes classification ambiguities. In this work, a framework for data-driven analysis and open-set classification is developed for fault diagnosis applications using the Kullback-Leibler divergence. A data-driven fault classification algorithm is proposed which can handle imbalanced datasets, class overlapping, and unknown faults. In addition, an algorithm is proposed to estimate the size of the fault when training data contains information from known fault realizations. An advantage of the proposed framework is that it can also be used for quantitative analysis of fault diagnosis performance, for example, to analyze how easy it is to classify faults of different magnitudes. To evaluate the usefulness of the proposed methods, multiple datasets from different fault scenarios have been collected from an internal combustion engine test bench to illustrate the design process of a data-driven diagnosis system, including quantitative fault diagnosis analysis and evaluation of the developed open set fault classification algorithm.
翻译:对动态系统的错误诊断是通过探测时间序列数据的变化,例如系统退化和缺陷组成部分造成的剩余数据。使用通用多级分类方法进行故障诊断,由于培训数据不平衡和未知的故障类别而复杂。另一个复杂因素是,不同的故障类别可能导致类似的剩余产出,特别是小故障,从而造成分类模糊。在这项工作中,为使用 Kullback-Leibler 差异的错误诊断应用开发了一个数据驱动分析和开放分类框架。提出了一种数据驱动的故障分类算法,可以处理不平衡的数据集、分类重叠和未知的错误。此外,还提出了一种算法,用以估计培训数据包含已知错误认识的信息时的过错大小。拟议框架的一个优点是,它也可以用于对错误诊断性能进行定量分析,例如,分析如何容易对不同程度的错误进行分类。为了评估拟议方法的效用,从内部燃烧引擎测试台收集了不同故障情景的多个数据集,用以说明数据驱动分析系统的设计过程,包括定量分析。