Småprat:通过转让学习创造瑞典对话的自然语言 DialoGPT (Småprat: DialoGPT for Natural Language Generation of Swedish Dialogue by Transfer Learning) - 专知论文

会员服务 ·

0

任务对话系统 · MoDELS · 迁移学习 · Performer · 学成 ·

2022 年 2 月 13 日

Småprat: DialoGPT for Natural Language Generation of Swedish Dialogue by Transfer Learning

翻译：Småprat:通过转让学习创造瑞典对话的自然语言 DialoGPT

Tosin Adewumi,Rickard Brännvall,Nosheen Abid,Maryam Pahlavan,Sana Sabah Sabry,Foteini Liwicki,Marcus Liwicki

from arxiv, Presented at Northern Lights Deep Learning Conference (NLDL) 2022, Tromso, Norway

Building open-domain conversational systems (or chatbots) that produce convincing responses is a recognized challenge. Recent state-of-the-art (SoTA) transformer-based models for the generation of natural language dialogue have demonstrated impressive performance in simulating human-like, single-turn conversations in English. This work investigates, by an empirical study, the potential for transfer learning of such models to Swedish language. DialoGPT, an English language pre-trained model, is adapted by training on three different Swedish language conversational datasets obtained from publicly available sources. Perplexity score (an automated intrinsic language model metric) and surveys by human evaluation were used to assess the performances of the fine-tuned models, with results that indicate that the capacity for transfer learning can be exploited with considerable success. Human evaluators asked to score the simulated dialogue judged over 57% of the chatbot responses to be human-like for the model trained on the largest (Swedish) dataset. We provide the demos and model checkpoints of our English and Swedish chatbots on the HuggingFace platform for public use.

翻译：建立能产生令人信服的反应的开放对话系统(或聊天机)是一个公认的挑战。最近为产生自然语言对话而采用的最新最先进的基于艺术的变压器模型在模拟英语人样的单回合对话方面表现出了令人印象深刻的成绩。这项工作通过一项经验研究调查了将这种模型的学习能力转移至瑞典语的可能性。英语预先培训的DialoGPT模式,通过培训从公开来源获得的三种不同的瑞典语言对话数据集而加以调整。常识分(自动内在语言模型)和人类评估调查被用来评估精细调模型的性能,结果表明转移学习的能力可以相当成功地加以利用。人类评价者要求将模拟对话评分到超过57%的聊天机反应中,以便像人类一样在最大的(瑞典语)数据集上培训的模型。我们提供了我们英语和瑞典语聊天机在HuggingFace平台上供公众使用的演示和示范检查站。

0

相关内容

任务对话系统

任务对话系统

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Zero-Shot Learning相关资源大列表

Zero-Shot Learning相关资源大列表

专知

52+阅读 · 2019年1月1日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

高角度分辨率光场数据获取与图像生成方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于扰动波速散射理论的高分辨率SAR回波信号建模和应用

国家自然科学基金

0+阅读 · 2013年12月31日

舰船尾流场激光偏振探测机理研究

国家自然科学基金

1+阅读 · 2013年12月31日

可积系统的代数与几何结构

国家自然科学基金

0+阅读 · 2013年12月31日

基于眼球协同运动机理的海上动目标自动追踪控制

国家自然科学基金

0+阅读 · 2013年12月31日

高温高压水热体系中CO2传感器的研制

国家自然科学基金

0+阅读 · 2012年12月31日

高分辨率和全极化SAR冰川制图研究

国家自然科学基金

0+阅读 · 2012年12月31日

生物启发自治水下机器人轨迹跟踪控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

复杂海洋环境下多机动目标跟踪机理及方法研究

国家自然科学基金

2+阅读 · 2011年12月31日

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年4月18日

Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning

Arxiv

1+阅读 · 2022年4月18日

Less is More: Learning to Refine Dialogue History for Personalized Dialogue Generation

Arxiv

0+阅读 · 2022年4月18日

Ìtàkúròso: Exploiting Cross-Lingual Transferability for Natural Language Generation of Dialogues in Low-Resource, African Languages

Arxiv

0+阅读 · 2022年4月17日

Just Fine-tune Twice: Selective Differential Privacy for Large Language Models

Arxiv

1+阅读 · 2022年4月15日

Evaluation Benchmarks for Spanish Sentence Representations

Arxiv

0+阅读 · 2022年4月15日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Recent Advances in Deep Learning-based Dialogue Systems

Arxiv

18+阅读 · 2021年5月10日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

Commonsense Reasoning for Natural Language Understanding: A Survey of Benchmarks, Resources, and Approaches

Arxiv

16+阅读 · 2019年4月2日

VIP会员

文章信息

相关主题

任务对话系统

相关VIP内容

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】面向时间序列基础模型的合成序列符号数据生成方法

军事通信市场七大趋势概述

【CMU博士论文】深度学习中泛化的量化、理解与改进

面向低光照图像增强的扩散模型

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Zero-Shot Learning相关资源大列表

Zero-Shot Learning相关资源大列表

专知

52+阅读 · 2019年1月1日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning

CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年4月18日

Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning

Arxiv

1+阅读 · 2022年4月18日

Less is More: Learning to Refine Dialogue History for Personalized Dialogue Generation

Arxiv

0+阅读 · 2022年4月18日

Ìtàkúròso: Exploiting Cross-Lingual Transferability for Natural Language Generation of Dialogues in Low-Resource, African Languages

Arxiv

0+阅读 · 2022年4月17日

Just Fine-tune Twice: Selective Differential Privacy for Large Language Models

Arxiv

1+阅读 · 2022年4月15日

Evaluation Benchmarks for Spanish Sentence Representations

Arxiv

0+阅读 · 2022年4月15日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Recent Advances in Deep Learning-based Dialogue Systems

Arxiv

18+阅读 · 2021年5月10日

Few-shot Natural Language Generation for Task-Oriented Dialog

Few-shot Natural Language Generation for Task-Oriented Dialog

Arxiv

30+阅读 · 2020年2月27日

Commonsense Reasoning for Natural Language Understanding: A Survey of Benchmarks, Resources, and Approaches

Arxiv

16+阅读 · 2019年4月2日

相关基金

高角度分辨率光场数据获取与图像生成方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于扰动波速散射理论的高分辨率SAR回波信号建模和应用

国家自然科学基金

0+阅读 · 2013年12月31日

舰船尾流场激光偏振探测机理研究

国家自然科学基金

1+阅读 · 2013年12月31日

可积系统的代数与几何结构

国家自然科学基金

0+阅读 · 2013年12月31日

基于眼球协同运动机理的海上动目标自动追踪控制

国家自然科学基金

0+阅读 · 2013年12月31日

高温高压水热体系中CO2传感器的研制

国家自然科学基金

0+阅读 · 2012年12月31日

高分辨率和全极化SAR冰川制图研究

国家自然科学基金

0+阅读 · 2012年12月31日

生物启发自治水下机器人轨迹跟踪控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

复杂海洋环境下多机动目标跟踪机理及方法研究

国家自然科学基金

2+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员