基于文本片段细粒度人类反馈的大语言模型微调方法 (Fine-Tuning LLMs with Fine-Grained Human Feedback on Text Spans) - 专知论文

会员服务 ·

0

片段 · 细粒度 · 粒度 · 模型微调 · 微调 ·

Fine-Tuning LLMs with Fine-Grained Human Feedback on Text Spans

翻译：基于文本片段细粒度人类反馈的大语言模型微调方法

Sky CH-Wang,Justin Svegliato,Helen Appel,Jason Eisner

We present a method and dataset for fine-tuning language models with preference supervision using feedback-driven improvement chains. Given a model response, an annotator provides fine-grained feedback by marking ``liked'' and ``disliked'' spans and specifying what they liked or disliked about them. The base model then rewrites the disliked spans accordingly, proceeding from left to right, forming a sequence of incremental improvements. We construct preference pairs for direct alignment from each adjacent step in the chain, enabling the model to learn from localized, targeted edits. We find that our approach outperforms direct alignment methods based on standard A/B preference ranking or full contrastive rewrites, demonstrating that structured, revision-based supervision leads to more efficient and effective preference tuning.

翻译：本文提出了一种利用反馈驱动改进链进行偏好监督的语言模型微调方法及相应数据集。给定模型生成的响应，标注者通过标记"认可"与"不认可"的文本片段并提供具体评价依据，实现细粒度反馈。基础模型据此从左至右依次重写不认可的片段，形成渐进式改进序列。我们通过链中相邻步骤构建直接对齐的偏好配对，使模型能够从局部化、目标明确的编辑中学习。实验表明，该方法在性能上优于基于标准A/B偏好排序或完整对比重写的直接对齐方法，证明结构化、基于修订的监督机制能实现更高效、更有效的偏好调优。

0

相关内容

图像反演：从生成对抗网络（GANs）到扩散模型及其未来发展综述

图像反演：从生成对抗网络（GANs）到扩散模型及其未来发展综述

专知会员服务

28+阅读 · 2月18日

【ICML2023】SEGA:结构熵引导的图对比学习锚视图

【ICML2023】SEGA:结构熵引导的图对比学习锚视图

专知会员服务

23+阅读 · 2023年5月10日

【ICCV2021】参数化对比学习

专知会员服务

33+阅读 · 2021年7月27日

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

专知会员服务

195+阅读 · 2020年5月31日

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

专知会员服务

21+阅读 · 2020年3月28日

【AAAI2021】自监督对应学习的对比转换

【AAAI2021】自监督对应学习的对比转换

专知

12+阅读 · 2020年12月11日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

论文笔记之attention mechanism专题1:SA-Net（CVPR 2018）

论文笔记之attention mechanism专题1:SA-Net（CVPR 2018）

统计学习与视觉计算组

16+阅读 · 2018年4月5日

【推荐系统论文笔记】DKN: 基于深度知识感知的新闻推荐网络（WWW2018 ）

【推荐系统论文笔记】DKN: 基于深度知识感知的新闻推荐网络（WWW2018 ）

专知

18+阅读 · 2018年4月2日

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

炼数成金订阅号

26+阅读 · 2017年7月10日

粗糙回归模型与算法研究

国家自然科学基金

8+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

结合图像块联合聚类加权和混合分类器的非对齐稀疏表示识别方法

国家自然科学基金

1+阅读 · 2015年12月31日

Jacobi行列式和Hilbert变换中的若干问题及应用

国家自然科学基金

0+阅读 · 2014年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

Poisson-Process Topic Model for Integrating Knowledge from Pre-trained Language Models

Arxiv

0+阅读 · 12月26日

Locally Repairable Convertible Codes: Improved Lower Bound and General Construction

Arxiv

0+阅读 · 12月25日

DFORD: Directional Feedback based Online Ordinal Regression Learning

Arxiv

0+阅读 · 12月22日

HATS: High-Accuracy Triple-Set Watermarking for Large Language Models

Arxiv

0+阅读 · 12月22日

Restrictive Hierarchical Semantic Segmentation for Stratified Tooth Layer Detection

Arxiv

0+阅读 · 12月22日

VIP会员

文章信息

相关主题

相关VIP内容

图像反演：从生成对抗网络（GANs）到扩散模型及其未来发展综述

图像反演：从生成对抗网络（GANs）到扩散模型及其未来发展综述

专知会员服务

28+阅读 · 2月18日

【ICML2023】SEGA:结构熵引导的图对比学习锚视图

【ICML2023】SEGA:结构熵引导的图对比学习锚视图

专知会员服务

23+阅读 · 2023年5月10日

【ICCV2021】参数化对比学习

专知会员服务

33+阅读 · 2021年7月27日

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

KG-BERT：基于BERT的知识图谱补全，KG-BERT: BERT for Knowledge Graph Completion

专知会员服务

195+阅读 · 2020年5月31日

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

专知会员服务

21+阅读 · 2020年3月28日

热门VIP内容

开通专知VIP会员享更多权益服务

《北约联合仿真与集成、验证与鉴定服务标准》2025最新40页

《面向协同任务的无人地面车辆与无人机（UGV-UAV）集成研究综述》2025最新综述论文

《理解大语言模型在军事战术任务规划中的局限性》

《国防与安全会议论文集》最新80页

相关资讯

【AAAI2021】自监督对应学习的对比转换

【AAAI2021】自监督对应学习的对比转换

专知

12+阅读 · 2020年12月11日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

论文笔记之attention mechanism专题1:SA-Net（CVPR 2018）

论文笔记之attention mechanism专题1:SA-Net（CVPR 2018）

统计学习与视觉计算组

16+阅读 · 2018年4月5日

【推荐系统论文笔记】DKN: 基于深度知识感知的新闻推荐网络（WWW2018 ）

【推荐系统论文笔记】DKN: 基于深度知识感知的新闻推荐网络（WWW2018 ）

专知

18+阅读 · 2018年4月2日

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

语义分割中的深度学习方法全解：从FCN、SegNet到DeepLab

炼数成金订阅号

26+阅读 · 2017年7月10日

相关论文

Poisson-Process Topic Model for Integrating Knowledge from Pre-trained Language Models

Arxiv

0+阅读 · 12月26日

Locally Repairable Convertible Codes: Improved Lower Bound and General Construction

Arxiv

0+阅读 · 12月25日

DFORD: Directional Feedback based Online Ordinal Regression Learning

Arxiv

0+阅读 · 12月22日

HATS: High-Accuracy Triple-Set Watermarking for Large Language Models

Arxiv

0+阅读 · 12月22日

Restrictive Hierarchical Semantic Segmentation for Stratified Tooth Layer Detection

Arxiv

0+阅读 · 12月22日

相关基金

粗糙回归模型与算法研究

国家自然科学基金

8+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

结合图像块联合聚类加权和混合分类器的非对齐稀疏表示识别方法

国家自然科学基金

1+阅读 · 2015年12月31日

Jacobi行列式和Hilbert变换中的若干问题及应用

国家自然科学基金

0+阅读 · 2014年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员