Controllable text generation concerns two fundamental tasks of wide applications, namely generating text of given attributes (i.e., attribute-conditional generation), and minimally editing existing text to possess desired attributes (i.e., text attribute transfer). Extensive prior work has largely studied the two problems separately, and developed different conditional models which, however, are prone to producing biased text (e.g., various gender stereotypes). This paper proposes to formulate controllable text generation from a principled causal perspective which models the two tasks with a unified framework. A direct advantage of the causal formulation is the use of rich causality tools to mitigate generation biases and improve control. We treat the two tasks as interventional and counterfactual causal inference based on a structural causal model, respectively. We then apply the framework to the challenging practical setting where confounding factors (that induce spurious correlations) are observable only on a small fraction of data. Experiments show significant superiority of the causal approach over previous conditional models for improved control accuracy and reduced bias.
翻译:可控文本生成涉及广泛应用的两个基本任务,即生成特定属性的文本(即属性条件生成),以及对现有文本进行最低限度的编辑,以拥有理想属性(即文本属性转移) 。先前的广泛工作主要分别研究了这两个问题,并开发了不同的有条件模型,但容易产生偏颇文本(例如,各种性别陈规定型观念) 本文建议从原则性因果角度制定可控文本生成,以统一框架为这两项任务的模型。因果表述的直接好处是使用丰富的因果关系工具来减少生成偏差并改进控制。我们分别将这两项任务视为基于结构性因果模型的干预性和反事实因果关系推断。我们随后将该框架应用于具有挑战性的实际环境,在这种环境中,只有一小部分数据可以观察到混杂因素(诱发虚假关联)。实验显示,因果关系方法比先前的有条件模型具有显著优势,以提高控制准确性和减少偏向性。