标题：使用全参和基于LoRA的微调方法在中文指令数据上进行指令跟随大语言模型的比较研究摘要：近年来，大语言模型的指令微调是自然语言处理领域的一个关键研究领域。由于资源和成本限制，一些研究人员采用了参数有效的微调技术，如LoRA，进行指令微调，并取得了鼓舞人心的结果。与全参数微调相比，基于LoRA的微调在培训成本方面显示出明显的优势。在本研究中，我们对全参数微调和基于LoRA的微调方法进行了实验比较，利用LLaMA作为基础模型。实验结果表明，选择基础模型、训练数据集规模、可学习参数数量和模型训练成本都是重要因素。我们希望本文的实验结论能为大语言模型的培训提供启示，特别是在中文领域，帮助研究人员找到更好的培训成本和模型性能之间的平衡策略。为了便于重现本文的结果，我们将发布数据集，模型和代码。 (A Comparative Study between Full-Parameter and LoRA-based Fine-Tuning on Chinese Instruction Data for Instruction Following Large Language Model)

翻译：标题：使用全参和基于LoRA的微调方法在中文指令数据上进行指令跟随大语言模型的比较研究摘要：近年来，大语言模型的指令微调是自然语言处理领域的一个关键研究领域。由于资源和成本限制，一些研究人员采用了参数有效的微调技术，如LoRA，进行指令微调，并取得了鼓舞人心的结果。与全参数微调相比，基于LoRA的微调在培训成本方面显示出明显的优势。在本研究中，我们对全参数微调和基于LoRA的微调方法进行了实验比较，利用LLaMA作为基础模型。实验结果表明，选择基础模型、训练数据集规模、可学习参数数量和模型训练成本都是重要因素。我们希望本文的实验结论能为大语言模型的培训提供启示，特别是在中文领域，帮助研究人员找到更好的培训成本和模型性能之间的平衡策略。为了便于重现本文的结果，我们将发布数据集，模型和代码。

Xianghui Sun,Yunjie Ji,Baochang Ma,Xiangang Li

Recently, the instruction-tuning of large language models is a crucial area of research in the field of natural language processing. Due to resource and cost limitations, several researchers have employed parameter-efficient tuning techniques, such as LoRA, for instruction tuning, and have obtained encouraging results In comparison to full-parameter fine-tuning, LoRA-based tuning demonstrates salient benefits in terms of training costs. In this study, we undertook experimental comparisons between full-parameter fine-tuning and LoRA-based tuning methods, utilizing LLaMA as the base model. The experimental results show that the selection of the foundational model, training dataset scale, learnable parameter quantity, and model training cost are all important factors. We hope that the experimental conclusions of this paper can provide inspiration for training large language models, especially in the field of Chinese, and help researchers find a better trade-off strategy between training cost and model performance. To facilitate the reproduction of the paper's results, the dataset, model and code will be released.

翻译：使用全参数和基于LoRA的微调在指令跟随大语言模型上进行了比较研究，在中文领域获得了实验结果。研究者们发现，基础模型、训练数据集规模、可学习参数数量和模型训练成本等因素对微调效果产生了重要影响。而对于指令跟随，基于LoRA的微调方法在培训成本方面具有更好的效果，相比之下使用全参微调更为昂贵。研究者希望这些实验结论能给语言模型的培训提供更好的指导，帮助发现成本和性能之间的平衡策略。为了便于他人重现实验结果，数据集、模型和代码将公开发布。