Controlled text generation is a very important task in the arena of natural language processing due to its promising applications. In order to achieve this task we mainly introduce the novel soft prompt tuning method of using soft prompts at both encoder and decoder levels together in a T5 model and investigate the performance as the behaviour of an additional soft prompt related to the decoder of a T5 model in controlled text generation remained unexplored. Then we also investigate the feasibility of steering the output of this extended soft prompted T5 model at decoder level and finally analyse the utility of generated text to be used in AI related tasks such as training AI models with an interpretability analysis of the classifier trained with synthetic text, as there is a lack of proper analysis of methodologies in generating properly labelled data to be utilized in AI tasks. Through the performed in-depth intrinsic and extrinsic evaluations of this generation model along with the artificially generated data, we found that this model produced better results compared to the T5 model with a single soft prompt at encoder level and the sentiment classifier trained using this artificially generated data can produce comparable classification results to the results of a classifier trained with real labelled data and also the classifier decision is interpretable with respect to the input text content.
翻译:为了完成这一任务,我们主要在T5模型中采用新型软快速调调方法,在编码器和解码器两级同时使用软提示,在编码器和解码器两个级别使用软提示,在T5模型中,由于在受控文本生成过程中与T5模型的解码器有关的额外软快速行为仍未进行探索,因此调查该功能的性能,因为与该T5模型解码器的解码器有关的另一个软快速行为还没有被探索。然后,我们还调查在解码器一级指导这一软驱动T5扩展模型的产出的可行性,并最终分析该软件生成的文本在AI相关任务中使用的效用,例如培训AI模型,对经过合成文本培训的分类器进行可解释性分析,因为缺乏对生成适当标签的数据用于AI任务的方法的适当分析。通过对这一生成的T5模型与人工生成的数据进行深入的内在和外部评估,我们发现该模型与T5模型相比产生更好的结果,在编码器一级使用单一的软提示,而使用这种人工生成的数据经过培训的情绪分类分类师能够产生可比的分类结果,同时对分类码式文件进行真正的翻译。