基于多级文本分类和生成传输学习的半监督半监督的半监督分解矢量定量化变异性自动电解码模型 (Entropy optimized semi-supervised decomposed vector-quantized variational autoencoder model based on transfer learning for multiclass text classification and generation)

2021 年 11 月 10 日

Entropy optimized semi-supervised decomposed vector-quantized variational autoencoder model based on transfer learning for multiclass text classification and generation

翻译：基于多级文本分类和生成传输学习的半监督半监督的半监督分解矢量定量化变异性自动电解码模型

Shivani Malhotra,Vinay Kumar,Alpana Agarwal

from arxiv, 12 pages, 4 figures

Semisupervised text classification has become a major focus of research over the past few years. Hitherto, most of the research has been based on supervised learning, but its main drawback is the unavailability of labeled data samples in practical applications. It is still a key challenge to train the deep generative models and learn comprehensive representations without supervision. Even though continuous latent variables are employed primarily in deep latent variable models, discrete latent variables, with their enhanced understandability and better compressed representations, are effectively used by researchers. In this paper, we propose a semisupervised discrete latent variable model for multi-class text classification and text generation. The proposed model employs the concept of transfer learning for training a quantized transformer model, which is able to learn competently using fewer labeled instances. The model applies decomposed vector quantization technique to overcome problems like posterior collapse and index collapse. Shannon entropy is used for the decomposed sub-encoders, on which a variable DropConnect is applied, to retain maximum information. Moreover, gradients of the Loss function are adaptively modified during backpropagation from decoder to encoder to enhance the performance of the model. Three conventional datasets of diversified range have been used for validating the proposed model on a variable number of labeled instances. Experimental results indicate that the proposed model has surpassed the state-of-the-art models remarkably.

翻译：近些年来,半监督的文本分类已成为研究的一个主要焦点。到目前为止,大多数研究都以监督学习为基础,但主要缺点在于没有在实际应用中提供标签数据样本。培训深度基因化模型和不经监督学习全面表达,这仍然是一个关键挑战。尽管连续的潜在变量主要用于深潜变异模型,但研究人员有效地使用了离散潜变异变量,其可理解性和压缩性更强。在本文中,我们提议为多级文本分类和文本生成建立一个半监督的离散潜在变异模型。拟议模型采用转移学习概念,用于培训一个量化变异模型,该模型能够使用较少标签实例进行称职的学习。模型应用分解矢量量化变异技术来克服后退和指数崩溃等问题。香农变潜变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变