Parameter-efficient methods are able to use a single frozen pre-trained large language model (LLM) to perform many tasks by learning task-specific soft prompts that modulate model behavior when concatenated to the input text. However, these learned prompts are tightly coupled to a given frozen model -- if the model is updated, corresponding new prompts need to be obtained. In this work, we propose and investigate several approaches to "Prompt Recycling'" where a prompt trained on a source model is transformed to work with the new target model. Our methods do not rely on supervised pairs of prompts, task-specific data, or training updates with the target model, which would be just as costly as re-tuning prompts with the target model from scratch. We show that recycling between models is possible (our best settings are able to successfully recycle $88.9\%$ of prompts, producing a prompt that out-performs baselines), but significant performance headroom remains, requiring improved recycling techniques.
翻译:参数效率方法能够使用单一的冻结前训练前大型语言模型(LLM)来完成许多任务,通过学习特定任务软提示来调节与输入文本相融合的模范行为。然而,这些学得提示与特定冻结模式紧密结合 -- -- 如果模型更新,则需要获得相应的新提示。在这项工作中,我们提出并调查了几种“快速再循环”方法,在这些方法中,对源模型进行迅速培训后与新目标模型一起发挥作用。我们的方法并不依赖监督的一对提示、任务特定数据或与目标模型培训更新,而目标模型的模范与从零开始的目标模型的再调时一样昂贵。我们表明,在模型之间回收是可能的(我们的最佳环境能够成功地回收88.9 美元的提示,产生出一个超标准基准),但重要的性能头仍然需要改进回收技术。