We present the development of a semi-supervised regression method using variational autoencoders (VAE), which is customized for use in soft sensing applications. We motivate the use of semi-supervised learning considering the fact that process quality variables are not collected at the same frequency as other process variables leading to many unlabelled records in operational datasets. These unlabelled records are not possible to use for training quality variable predictions based on supervised learning methods. Use of VAEs for unsupervised learning is well established and recently they were used for regression applications based on variational inference procedures. We extend this approach of supervised VAEs for regression (SVAER) to make it learn from unlabelled data leading to semi-supervised VAEs for regression (SSVAER), then we make further modifications to their architecture using additional regularization components to make SSVAER well suited for learning from both labelled and unlabelled process data. The probabilistic regressor resulting from the variational approach makes it possible to estimate the variance of the predictions simultaneously, which provides an uncertainty quantification along with the generated predictions. We provide an extensive comparative study of SSVAER with other publicly available semi-supervised and supervised learning methods on two benchmark problems using fixed-size datasets, where we vary the percentage of labelled data available for training. In these experiments, SSVAER achieves the lowest test errors in 11 of the 20 studied cases, compared to other methods where the second best gets 4 lowest test errors out of the 20.
翻译:我们推出一种半监督回归法,使用变式自动读数器(VAE)进行半监督回归法,这是定制的,用于软感应用。我们鼓励使用半监督的学习方法,因为过程质量变量的收集频率不同于导致业务数据集中许多未加标签的记录的其他过程变量。这些未贴标签的记录无法用于培训基于受监督的学习方法的质量变量预测。使用无监督的SSSAE学习的VAE系统,已经很好地建立,最近,它们被用于基于变式推断程序的回归应用。我们推广了这种由监管的VAE系统(SVAER)对最低回归(SVAER)进行比较的方法,以便从未贴标签的数据中学习到半监督的VAE系统(SSVAER)回归(SSVAER),然后我们进一步修改它们的架构,使用额外的正规化组件,使SSVAER能够很好地从贴标签和未贴标签的流程数据中学习。由于变式方法,20种不稳定性递增法使得有可能同时估算预测的差异,这提供了一种不确定性的比较性 VAER(VER),同时进行初步的测试,在所研究的4号中,我们利用了其他的测试方法,我们用了其他的固定的测试方法进行了两次数据测试。