The simplest way to obtain continuous interpolation between two points in high dimensional space is to draw a line between them. While previous works focused on the general connectivity between model parameters, we explored linear interpolation for parameters of pre-trained models after fine-tuning. Surprisingly, we could perform linear interpolation without a performance drop in intermediate points for fine-tuned models. For controllable text generation, such interpolation could be seen as moving a model towards or against the desired text attribute (e.g., positive sentiment), which could be used as grounds for further methods for controllable text generation without inference speed overhead.
翻译:获得高维空间两个点之间连续的内插的最简单方法是在两个点之间划线。虽然以前的工作侧重于模型参数之间的一般连通性,但我们探索了经过微调的经过训练的模型参数的线性内插性。令人惊讶的是,我们可以进行线性内插,而不会在微调模型的中间点上出现性能下降。 对于可控制的文本生成,这种内插性可被视为向或与预期的文本属性(例如正感)移动模式,而这种模式可以用作在不推断速度间接的情况下进一步采用可控文本生成方法的理由。