Modern industrial facilities generate large volumes of raw sensor data during production process. This data is used to monitor and control the processes and can be analyzed to detect and predict process abnormalities. Typically, the data has to be annotated by experts to be further used in predictive modeling. Most of today's research is focusing on either unsupervised anomaly detection algorithms or supervised methods, that require manually annotated data. The studies are often done using process simulator generated data for a narrow class of events and proposed algorithms are rarely verified on publicly available datasets. In this paper, we propose a novel method SensorSCAN for unsupervised fault detection and diagnosis designed for industrial chemical sensor data. We demonstrate our model performance on two publicly available datasets based on the Tennessee Eastman Process with various fault types. Results show that our method significantly outperforms existing approaches (+0.2-0.3 TPR for a fixed FPR) and detects most of the process faults without the use of expert annotation. In addition, we performed experiments to show that our method is suitable for real-world applications where the number of fault types is not known in advance.
翻译:现代工业设施在生产过程中产生大量原始传感器数据。这些数据用于监测和控制过程,并可用于分析以检测和预测过程异常。通常,数据必须由专家附加说明,以便进一步用于预测模型。今天的大多数研究侧重于未经监督的异常检测算法或监督方法,需要人工附加说明数据。这些研究往往使用程序模拟器为一小类事件生成的数据进行,而拟议的算法很少在公开的数据集中进行核实。我们在本文件中提出了用于为工业化学传感器数据设计不受监督的故障检测和诊断的传感器SensorSCAN新方法。我们用两种基于田纳西东曼进程、各种类型故障的公开数据集展示我们的模型性能。结果显示,我们的方法大大超出现有方法(固定FPR+0.0.0.03 TPR),在不使用专家注解的情况下检测大部分过程错误。此外,我们进行了实验,以表明我们的方法适合在尚未事先知道过失类型数量的地方进行真实世界应用。