The success of current machine learning on image-based combustion monitoring is based on massive data, which is costly even impossible for industrial applications. To address this conflict, we introduce few-shot learning to combustion monitoring for the first time. Two algorithms, Siamese Network coupled with k Nearest Neighbors (SN-kNN) and Prototypical Network (PN), are attempted. Besides, rather than purely utilizing visual images as previous studies, we also attempt Infrared (IR) images. In this work, we analyze the training process, test performance and inference speed of two algorithms on both image formats, and also use t-SNE to visualize learned features. The results demonstrate that both SN-kNN and PN are capable to distinguish flame states from learning with 20 images per flame state. The worst performance, which is realized by combination of PN and IR images, still possesses precision, accuracy, recall, and F1-score all above 0.95. Through observing images and visualizing features, we realize that visual images have more dramatic differences between classes and have more consistent patterns inside the class, which makes the training speed and model performance on visual images is better. In contrast, the relatively "low-quality" IR images makes PN hard to extract distinguishable prototypes, which causes the relative weak performance, but with the whole training set to support classification, SN-kNN cooperates well with IR images. On the other hand, benefited from the architecture design, PN has a much faster speed in training and inference than SN-kNN. The work here analyzes the characteristics of both algorithms and image formats for the first time, which provides the guidance for further utilizing them in combustion monitoring tasks.
翻译:目前机器在基于图像的燃烧监测上的成功学习基于大量数据,对于工业应用来说,这种数据甚至成本昂贵,甚至不可能实现。为了解决这一冲突,我们首次引入燃烧监测的微小学习。尝试了两种算法,即Siamese 网络和K Nearest 邻居网络(SN-kNNN)和Protodmid 网络(PN)。此外,我们不仅没有像以往的研究那样纯粹使用图像,还尝试红外线(IR)图像。在这项工作中,我们分析两个图像格式的训练过程、测试性能和推断速度,并使用T-SNEE 来视觉特征。结果显示,SN-kNN和PN能够将火焰状态与每个火焰状态的20个图像(SN-kNNN)和Protocl 网络(PNW)的学习区别开来。最差的性能是PNP和IR图像的组合,仍然拥有精确、准确、回顾和F1 - 核心全部高于0.95的图像。我们意识到视觉图像的差别,我们意识到视觉图像在课堂上的差别和更加一致的图型模式上,使得培训速度和SNNNL的精确的精确和模型的精度的精确和模型的精确性能能比比。