Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions - 专知论文

会员服务 ·

0

Performer · 语言模型化 · MoDELS · 可理解性 · Analysis ·

2023 年 5 月 24 日

Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions

翻译：暂无翻译

Jiahuan Li,Hao Zhou,Shujian Huang,Shanbo Chen,Jiajun Chen

Large-scale Pretrained Language Models~(LLMs), such as ChatGPT and GPT4, have shown strong abilities in multilingual translations, without being explicitly trained on parallel corpora. It is interesting how the LLMs obtain their ability to carry out translation instructions for different languages. In this paper, we present a detailed analysis by finetuning a multilingual pretrained language model, XGLM-7B, to perform multilingual translation following given instructions. Firstly, we show that the multilingual LLMs have stronger translation abilities than previously demonstrated. For a certain language pair, the performance depends on both the language families and the amount of data used in the pretraining phase. Secondly, we find that LLMs' ability to carry out translation instructions relies on the understanding of translation instruction and the alignment among different languages. With proper enhancement, LLMs could perform the translation task well even for those language pairs unseen during the instruction tuning phase.

翻译：暂无翻译

0

相关内容

Performer

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

高核币金属簇合物的设计合成与性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

Copine VII在阿尔茨海默病中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

慕士塔格冰芯硝酸盐氮氧同位素记录的过去500年来大气活性氮含量变化研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Petri网与协同过滤的云上Web服务可信性量化分析与预测的研究

国家自然科学基金

0+阅读 · 2014年12月31日

电化学法制备金属纳米粒子/金属有机骨架复合膜及其电催化性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

食品安全网络舆情演化机理与应对策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

双液异种金属复合界面凝固行为及梯度复合层形成机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

固体电化学定硫传感器辅助电极的研究

国家自然科学基金

0+阅读 · 2012年12月31日

微流控芯片—毛细管电泳—微液滴喷射雾化器等离子体质谱联机进行单细胞内金属形态分析的研究

国家自然科学基金

0+阅读 · 2012年12月31日

超临界流体辅助化学镀涤纶织物金属镀层制备及界面结合性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

On the Need for a Language Describing Distribution Shifts: Illustrations on Tabular Datasets

Arxiv

0+阅读 · 2023年7月11日

Shaping the Emerging Norms of Using Large Language Models in Social Computing Research

Arxiv

0+阅读 · 2023年7月9日

Evaluating the Capability of Large-scale Language Models on Chinese Grammatical Error Correction Task

Arxiv

0+阅读 · 2023年7月8日

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Arxiv

0+阅读 · 2023年7月7日

Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model

Arxiv

0+阅读 · 2023年7月7日

BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages

Arxiv

0+阅读 · 2023年7月7日

Guiding Large Language Models via Directional Stimulus Prompting

Arxiv

1+阅读 · 2023年7月7日

A Survey on Multimodal Large Language Models

Arxiv

25+阅读 · 2023年6月23日

A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future Directions

Arxiv

16+阅读 · 2023年2月9日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

【斯坦福博士论文】数据、决策与依赖：构建可信人工智能的挑战

人工智能时代背景下的未来海战

接触战中的无人机优势：美军旅级部队面临的小型无人机系统挑战与调整

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

On the Need for a Language Describing Distribution Shifts: Illustrations on Tabular Datasets

Arxiv

0+阅读 · 2023年7月11日

Shaping the Emerging Norms of Using Large Language Models in Social Computing Research

Arxiv

0+阅读 · 2023年7月9日

Evaluating the Capability of Large-scale Language Models on Chinese Grammatical Error Correction Task

Arxiv

0+阅读 · 2023年7月8日

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Arxiv

0+阅读 · 2023年7月7日

Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model

Arxiv

0+阅读 · 2023年7月7日

BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages

Arxiv

0+阅读 · 2023年7月7日

Guiding Large Language Models via Directional Stimulus Prompting

Arxiv

1+阅读 · 2023年7月7日

A Survey on Multimodal Large Language Models

Arxiv

25+阅读 · 2023年6月23日

A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future Directions

Arxiv

16+阅读 · 2023年2月9日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

相关基金

高核币金属簇合物的设计合成与性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

Copine VII在阿尔茨海默病中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

慕士塔格冰芯硝酸盐氮氧同位素记录的过去500年来大气活性氮含量变化研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Petri网与协同过滤的云上Web服务可信性量化分析与预测的研究

国家自然科学基金

0+阅读 · 2014年12月31日

电化学法制备金属纳米粒子/金属有机骨架复合膜及其电催化性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

食品安全网络舆情演化机理与应对策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

双液异种金属复合界面凝固行为及梯度复合层形成机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

固体电化学定硫传感器辅助电极的研究

国家自然科学基金

0+阅读 · 2012年12月31日

微流控芯片—毛细管电泳—微液滴喷射雾化器等离子体质谱联机进行单细胞内金属形态分析的研究

国家自然科学基金

0+阅读 · 2012年12月31日

超临界流体辅助化学镀涤纶织物金属镀层制备及界面结合性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员