OpenAssistant 会话 -- 民主化大型语言模型对齐 (OpenAssistant Conversations -- Democratizing Large Language Model Alignment)

Andreas Köpf,Yannic Kilcher,Dimitri von Rütte,Sotiris Anagnostidis,Zhi-Rui Tam,Keith Stevens,Abdullah Barhoum,Nguyen Minh Duc,Oliver Stanley,Richárd Nagyfi,Shahul ES,Sameer Suri,David Glushkov,Arnav Dantuluri,Andrew Maguire,Christoph Schuhmann,Huu Nguyen,Alexander Mattick

from arxiv, 44 pages, 39 figures

Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their accessibility and utility across various domains. However, state-of-the-art alignment techniques like RLHF rely on high-quality human feedback data, which is expensive to create and often remains proprietary. In an effort to democratize research on large-scale alignment, we release OpenAssistant Conversations, a human-generated, human-annotated assistant-style conversation corpus consisting of 161,443 messages distributed across 66,497 conversation trees, in 35 different languages, annotated with 461,292 quality ratings. The corpus is a product of a worldwide crowd-sourcing effort involving over 13,500 volunteers. To demonstrate the OpenAssistant Conversations dataset's effectiveness, we present OpenAssistant, the first fully open-source large-scale instruction-tuned model to be trained on human data. A preference study revealed that OpenAssistant replies are comparably preferred to GPT-3.5-turbo (ChatGPT) with a relative winrate of 48.3% vs. 51.7% respectively. We release our code and data under fully permissive licenses.

翻译：对齐大型语言模型 (LLM) 与人类偏好的方法已经被证明可以极大地提高可用性，并已在 ChatGPT 中得到广泛应用。监督微调 (SFT) 和基于人类反馈的强化学习 (RLHF) 等对齐技术大大降低了有效利用 LLM 能力所需的技能和领域知识，增强了在各种领域的可访问性和实用性。然而，最先进的 RLHF 对齐技术依赖于高质量的人类反馈数据，这种数据的创建成本高，并且通常是专有的。为了民主化大规模对齐研究，我们发布了 OpenAssistant 会话数据集，这是一个包含 161,443 条消息、分布在 66,497 个对话树中的人工生成、人工注释的助手式对话语料库，使用 35 种不同的语言进行注释，共有 461,292 个质量评分。该语料库是一个全球性的众包活动的成果，涉及到超过 13,500 名志愿者。为了展示 OpenAssistant 会话数据集的有效性，我们提出了 OpenAssistant，这是第一个完全基于公开数据集训练的大规模指令调整模型。一项偏好研究表明，OpenAssistant 的回复与 GPT-3.5-涡轮 (ChatGPT) 相比具有可比性，相对胜率为 48.3% 对 51.7%。我们以完全开放的许可证发布我们的代码和数据。