Time-series data are one of the fundamental types of raw data representation used in data-driven techniques. In machine condition monitoring, time-series vibration data are overly used in data mining for deep neural networks. Typically, vibration data is converted into images for classification using Deep Neural Networks (DNNs), and scalograms are the most effective form of image representation. However, the DNN classifiers require huge labeled training samples to reach their optimum performance. So, many forms of data augmentation techniques are applied to the classifiers to compensate for the lack of training samples. However, the scalograms are graphical representations where the existing augmentation techniques suffer because they either change the graphical meaning or have too much noise in the samples that change the physical meaning. In this study, a data augmentation technique named ensemble augmentation is proposed to overcome this limitation. This augmentation method uses the power of white noise added in ensembles to the original samples to generate real-like samples. After averaging the signal with ensembles, a new signal is obtained that contains the characteristics of the original signal. The parameters for the ensemble augmentation are validated using a simulated signal. The proposed method is evaluated using 10 class bearing vibration data using three state-of-the-art Transfer Learning (TL) models, namely, Inception-V3, MobileNet-V2, and ResNet50. Augmented samples are generated in two increments: the first increment generates the same number of fake samples as the training samples, and in the second increment, the number of samples is increased gradually. The outputs from the proposed method are compared with no augmentation, augmentations using deep convolution generative adversarial network (DCGAN), and several geometric transformation-based augmentations...
翻译:时间序列数据是数据驱动技术中使用的原始数据代表的基本类型之一。 在机器状况监测中, 时间序列振动数据被过度用于深度神经网络的数据挖掘中。 通常, 振动数据被转换成图像, 以便使用深神经网络进行分类, 而计算图则是图像表述中最有效的形式 。 然而, DNN 分类器需要大量的贴标签的培训样本才能达到最佳性能 。 因此, 数据增强技术的许多形式被应用到分类器中, 以弥补缺乏培训样本。 然而, 在机器状况监测中, 时间序列振动数据被过度用于深度神经网络的数据采集中, 现有的增强技术因改变图形含义或者在样本中噪音过多而改变物理含义而受到影响。 在这次研究中, 一种名为“ 共振动” 的数据增强技术, 在原始样本中添加了白色噪音的力量, 以产生真实的样本。 因此, 在将基于 编模的信号中, 获取了一个包含原始信号的新的信号。 在二号中, 高级增强技术的参数是使用模拟变压模型中,, 正在使用模拟的升级中, 递增变动数据 。