基于重要性采样实现大语言模型对齐模块的分离 (Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models)

The widespread adoption of large language models (LLMs) across industries has increased the demand for high-quality and customizable outputs. However, traditional alignment methods often require retraining large pretrained models, making it difficult to quickly adapt and optimize LLMs for diverse applications. To address this limitation, we propose a novel \textit{Residual Alignment Model} (\textit{RAM}) that formalizes the alignment process as a type of importance sampling. In this framework, the unaligned upstream model serves as the proposal distribution, while the alignment process is framed as secondary sampling based on an autoregressive alignment module that acts as an estimator of the importance weights. This design enables a natural detachment of the alignment module from the target aligned model, improving flexibility and scalability. Based on this model, we derive an efficient sequence-level training strategy for the alignment module, which operates independently of the proposal module. Additionally, we develop a resampling algorithm with iterative token-level decoding to address the common first-token latency issue in comparable methods. Experimental evaluations on two leading open-source LLMs across diverse tasks, including instruction following, domain adaptation, and preference optimization, demonstrate that our approach consistently outperforms baseline models.

翻译：随着大语言模型在各行业的广泛应用，对高质量、可定制化输出的需求日益增长。然而，传统的对齐方法通常需要对大规模预训练模型进行重新训练，这导致难以针对多样化应用场景快速适配和优化大语言模型。为解决这一局限，我们提出一种新颖的\textit{残差对齐模型}（\textit{RAM}），将对齐过程形式化为一种重要性采样。在此框架中，未对齐的上游模型充当提议分布，而对齐过程则被构建为基于自回归对齐模块的二次采样，该模块作为重要性权重的估计器。这种设计使得对齐模块能够自然地与目标对齐模型分离，从而提升灵活性与可扩展性。基于该模型，我们推导出一种针对对齐模块的高效序列级训练策略，该策略独立于提议模块运行。此外，我们开发了一种具有迭代词元级解码的重采样算法，以解决同类方法中常见的首词元延迟问题。在两个领先的开源大语言模型上进行的多任务实验评估（包括指令遵循、领域适配和偏好优化）表明，我们的方法在各项任务中均持续优于基线模型。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日