Formality style transfer is the task of converting informal sentences to grammatically-correct formal sentences, which can be used to improve performance of many downstream NLP tasks. In this work, we propose a semi-supervised formality style transfer model that utilizes a language model-based discriminator to maximize the likelihood of the output sentence being formal, which allows us to use maximization of token-level conditional probabilities for training. We further propose to maximize mutual information between source and target styles as our training objective instead of maximizing the regular likelihood that often leads to repetitive and trivial generated responses. Experiments showed that our model outperformed previous state-of-the-art baselines significantly in terms of both automated metrics and human judgement. We further generalized our model to unsupervised text style transfer task, and achieved significant improvements on two benchmark sentiment style transfer datasets.
翻译:形式风格的转移是将非正式句子转换成文法上更正的正式句子的任务,可以用来改进下游国家劳工局许多下游任务的绩效。在这项工作中,我们建议采用半监督的手续风格转移模式模式,使用基于语言的模式歧视者,最大限度地提高产出句形式化的可能性,从而使我们能够在培训中尽量利用象征性水平有条件的概率。我们进一步提议最大限度地利用源与目标风格之间的相互信息,以此作为我们的培训目标,而不是尽量扩大经常导致重复和微不足道的反应的可能性。实验表明,我们的模型在自动计量和人类判断方面都大大超过以往最先进的基线。我们进一步推广了我们的模式,将不受监督的文本风格转移任务推广到不受监督的文本风格转移任务中,并在两个基准情绪风格传输数据集上取得了重大改进。