以人类反馈推动开放域聊天室 (Towards Boosting the Open-Domain Chatbot with Human Feedback) - 专知论文

会员服务 ·

0

Boosting（一种模型训练加速方式） · Chatbot · 任务对话系统 · INTERACT · Performer ·

2022 年 8 月 30 日

Towards Boosting the Open-Domain Chatbot with Human Feedback

翻译：以人类反馈推动开放域聊天室

Hua Lu,Siqi Bao,Huang He,Fan Wang,Hua Wu,Haifeng Wang

from arxiv, First two authors contributed equally to this work

Many open-domain dialogue models pre-trained with social media comments can generate coherent replies but have difficulties producing engaging responses when interacting with real users. This phenomenon might mainly result from the deficiency of annotated human-human conversations and the misalignment with human preference. In this paper, we propose a novel and efficient approach Diamante to boost the open-domain chatbot, where two kinds of human feedback (including explicit demonstration and implicit preference) are collected and leveraged. By asking annotators to select or amend the model-generated candidate responses, Diamante efficiently collects the human demonstrated responses and constructs a Chinese chit-chat dataset. To enhance the alignment with human preference, Diamante leverages the implicit preference in the data collection process and introduces the generation-evaluation joint training. Comprehensive experiments indicate that the Diamante dataset and joint training paradigm can significantly boost the performance of Chinese pre-trained dialogue models.

翻译：在经过社交媒体评论培训之前,许多开放式对话模式在与实际用户互动时可以产生一致的答复,但很难产生有吸引力的反应。这种现象可能主要是由于缺乏附加说明的人类对话,与人类偏好不吻合。在本文中,我们提出一种新颖而有效的Diamante方法,以提升开放的Diamante聊天室,其中收集和利用两种类型的人类反馈(包括明确的演示和隐含的偏好)。Diamante通过要求通知者选择或修改模型产生的候选回应,有效地收集了人类展示的反应,并构建了中国的chit聊天数据集。为了更好地与人类偏好保持一致, Diamante利用数据收集过程中的隐含偏好,并引入了一代评价联合培训。全面实验表明, Diamante数据集和联合培训模式可以极大地提升中国预先培训的对话模式的性能。

0

相关内容

Boosting（一种模型训练加速方式）

Boosting（一种模型训练加速方式）

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

专知

12+阅读 · 2018年3月15日

肿瘤间充质干细胞通过CCL22影响非小细胞肺癌化疗敏感性的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

脉冲磁致振荡促进固液界面前沿异质形核机理

国家自然科学基金

0+阅读 · 2015年12月31日

基于刀具变形和工件亚表层质量预测模型的自由曲面超精密铣削加工运动规划

国家自然科学基金

0+阅读 · 2014年12月31日

基于CS算法的数字信号压缩和高效数字系统设计的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ni3Al基金属间化合物多尺度本构模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

发光二极管LED非相干宽带腔增强吸收光谱技术对大气HONO的定量方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

McMullen函数族及其推广的动力系统

国家自然科学基金

0+阅读 · 2011年12月31日

低层错能镍基变形高温合金反常动态应变时效机理

国家自然科学基金

0+阅读 · 2011年12月31日

基于介电响应技术的油纸绝缘状态评估方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

战略联盟内组织间跨边界知识共享研究

国家自然科学基金

0+阅读 · 2008年12月31日

Information Extraction and Human-Robot Dialogue towards Real-life Tasks: A Baseline Study with the MobileCS Dataset

Arxiv

0+阅读 · 2022年10月18日

CrossRE: A Cross-Domain Dataset for Relation Extraction

Arxiv

0+阅读 · 2022年10月17日

Robust Imitation of a Few Demonstrations with a Backwards Model

Arxiv

0+阅读 · 2022年10月17日

Towards Cognitive Robots That People Accept in Their Home

Arxiv

0+阅读 · 2022年10月17日

Practical Benefits of Feature Feedback Under Distribution Shift

Arxiv

0+阅读 · 2022年10月17日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

A Survey of Human-in-the-loop for Machine Learning

Arxiv

35+阅读 · 2021年8月2日

Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond

Arxiv

15+阅读 · 2020年5月13日

Towards a Human-like Open-Domain Chatbot

Arxiv

14+阅读 · 2020年1月27日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

VIP会员

文章信息

相关主题

Boosting（一种模型训练加速方式）

任务对话系统

相关VIP内容

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】行动，规划与学习，622页pdf

美军坦克部队反无人机新策略：主炮轰击方案

【ICML2025】免费的Fisher？通过回收平方梯度累加器近似Fisher信息矩阵

数据质量维度的实践展开：一项综述

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

【论文推荐】最新7篇聊天机器人（Chatbot）相关论文—触动你的心、DeepProbe、饮食推荐、知识学习、交互、挑战、管理

专知

12+阅读 · 2018年3月15日

相关论文

Information Extraction and Human-Robot Dialogue towards Real-life Tasks: A Baseline Study with the MobileCS Dataset

Arxiv

0+阅读 · 2022年10月18日

CrossRE: A Cross-Domain Dataset for Relation Extraction

Arxiv

0+阅读 · 2022年10月17日

Robust Imitation of a Few Demonstrations with a Backwards Model

Arxiv

0+阅读 · 2022年10月17日

Towards Cognitive Robots That People Accept in Their Home

Arxiv

0+阅读 · 2022年10月17日

Practical Benefits of Feature Feedback Under Distribution Shift

Arxiv

0+阅读 · 2022年10月17日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

A Survey of Human-in-the-loop for Machine Learning

Arxiv

35+阅读 · 2021年8月2日

Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond

Arxiv

15+阅读 · 2020年5月13日

Towards a Human-like Open-Domain Chatbot

Arxiv

14+阅读 · 2020年1月27日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

相关基金

肿瘤间充质干细胞通过CCL22影响非小细胞肺癌化疗敏感性的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

脉冲磁致振荡促进固液界面前沿异质形核机理

国家自然科学基金

0+阅读 · 2015年12月31日

基于刀具变形和工件亚表层质量预测模型的自由曲面超精密铣削加工运动规划

国家自然科学基金

0+阅读 · 2014年12月31日

基于CS算法的数字信号压缩和高效数字系统设计的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ni3Al基金属间化合物多尺度本构模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

发光二极管LED非相干宽带腔增强吸收光谱技术对大气HONO的定量方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

McMullen函数族及其推广的动力系统

国家自然科学基金

0+阅读 · 2011年12月31日

低层错能镍基变形高温合金反常动态应变时效机理

国家自然科学基金

0+阅读 · 2011年12月31日

基于介电响应技术的油纸绝缘状态评估方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

战略联盟内组织间跨边界知识共享研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员