MetTPTrans:多语言守则代表制学习的元学习方法 (MetaTPTrans: A Meta Learning Approach for Multilingual Code Representation Learning) - 专知论文

会员服务 ·

0

Learning · 代码 · INFORMS · 元学习 · 表示学习 ·

2022 年 6 月 13 日

MetaTPTrans: A Meta Learning Approach for Multilingual Code Representation Learning

翻译：MetTPTrans:多语言守则代表制学习的元学习方法

Weiguo Pian,Hanyu Peng,Xunzhu Tang,Tiezhu Sun,Haoye Tian,Andrew Habib,Jacques Klein,Tegawendé F. Bissyandé

from arxiv, Technical report

Representation learning of source code is essential for applying machine learning to software engineering tasks. Learning code representation across different programming languages has been shown to be more effective than learning from single-language datasets, since more training data from multi-language datasets improves the model's ability to extract language-agnostic information from source code. However, existing multi-language models overlook the language-specific information which is crucial for downstream tasks that is training on multi-language datasets, while only focusing on learning shared parameters among the different languages. To address this problem, we propose MetaTPTrans, a meta learning approach for multilingual code representation learning. MetaTPTrans generates different parameters for the feature extractor according to the specific programming language of the input source code snippet, enabling the model to learn both language-agnostics and language-specific information. Experimental results show that MetaTPTrans improves the F1 score of state-of-the-art approaches significantly by up to 2.40 percentage points for code summarization, a language-agnostic task; and the prediction accuracy of Top-1 (Top-5) by up to 7.32 (13.15) percentage points for code completion, a language-specific task.

翻译：对源代码进行代表式学习对于应用机器学习软件工程任务至关重要。不同编程语言的学习代码显示比从单一语言数据集学习更有效,因为来自多语种数据集的更多培训数据提高了该模式从源代码中提取语言不可知信息的能力。然而,现有的多语言模型忽略了对下游任务至关重要的语言特定信息,即多语言数据集培训,而仅仅侧重于学习不同语言之间的共享参数。为解决这一问题,我们提议MetaTPTrans,这是多语言代码代表学习的一种元学习方法。MetaTPTrans根据输入源代码片断的具体编程语言为功能提取器生成了不同的参数,使该模型能够学习语言不可知性和语言特定信息。实验结果表明,MetaTPTrans通过高达2.40百分点的速率,用于拼凑代码,一种语言认知任务;Top-1(Top-5)预测精度,最长为7.32分(13.15分),用于完成一个具体语言任务。

0

相关内容

Learning

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

setdb1与Tiam1相互作用通过调控EMT促进肝癌侵袭转移

国家自然科学基金

0+阅读 · 2015年12月31日

PKCα/PPARγ介导TAMs代谢重编程及其对肿瘤微环境炎性的调控

国家自然科学基金

0+阅读 · 2014年12月31日

膜蛋白介导受IRES调控的cyclin B1促进食管癌转移的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

自旋轨道耦合玻色凝聚体的拓扑量子态和量子动力学性质

国家自然科学基金

0+阅读 · 2014年12月31日

双波长荧光发射量子点复合体系的构建及温度传感应用

国家自然科学基金

0+阅读 · 2014年12月31日

TR4翻译后修饰与宫内发育迟缓大鼠代谢综合征的易感机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于HPLC-TPMT EAD-MS联用技术的天芪降糖胶囊中TPMT酶亲和活性成分研究

国家自然科学基金

0+阅读 · 2013年12月31日

G蛋白偶联受体激酶5在小鼠社会交互行为中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

前列腺癌组织特异microRNA表达谱研究

国家自然科学基金

0+阅读 · 2008年12月31日

食管癌转移高风险性相关的SNP位点筛查研究

国家自然科学基金

0+阅读 · 2008年12月31日

Masked Autoencoders As The Unified Learners For Pre-Trained Sentence Representation

Arxiv

0+阅读 · 2022年7月30日

Adding Context to Source Code Representations for Deep Learning

Arxiv

0+阅读 · 2022年7月30日

KG-NSF: Knowledge Graph Completion with a Negative-Sample-Free Approach

Arxiv

0+阅读 · 2022年7月29日

Multi-Task Learning for Visual Scene Understanding

Arxiv

29+阅读 · 2022年3月28日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Arxiv

12+阅读 · 2019年9月26日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Few-shot Learning with Meta Metric Learners

Arxiv

13+阅读 · 2019年1月26日

Hierarchical Graph Representation Learning with Differentiable Pooling

Hierarchical Graph Representation Learning with Differentiable Pooling

Arxiv

13+阅读 · 2018年6月26日

VIP会员

文章信息

相关主题

相关VIP内容

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

相关论文

Masked Autoencoders As The Unified Learners For Pre-Trained Sentence Representation

Arxiv

0+阅读 · 2022年7月30日

Adding Context to Source Code Representations for Deep Learning

Arxiv

0+阅读 · 2022年7月30日

KG-NSF: Knowledge Graph Completion with a Negative-Sample-Free Approach

Arxiv

0+阅读 · 2022年7月29日

Multi-Task Learning for Visual Scene Understanding

Arxiv

29+阅读 · 2022年3月28日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

A Simple Framework for Contrastive Learning of Visual Representations

Arxiv

21+阅读 · 2020年2月13日

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Arxiv

12+阅读 · 2019年9月26日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

Few-shot Learning with Meta Metric Learners

Arxiv

13+阅读 · 2019年1月26日

Hierarchical Graph Representation Learning with Differentiable Pooling

Hierarchical Graph Representation Learning with Differentiable Pooling

Arxiv

13+阅读 · 2018年6月26日

相关基金

setdb1与Tiam1相互作用通过调控EMT促进肝癌侵袭转移

国家自然科学基金

0+阅读 · 2015年12月31日

PKCα/PPARγ介导TAMs代谢重编程及其对肿瘤微环境炎性的调控

国家自然科学基金

0+阅读 · 2014年12月31日

膜蛋白介导受IRES调控的cyclin B1促进食管癌转移的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

自旋轨道耦合玻色凝聚体的拓扑量子态和量子动力学性质

国家自然科学基金

0+阅读 · 2014年12月31日

双波长荧光发射量子点复合体系的构建及温度传感应用

国家自然科学基金

0+阅读 · 2014年12月31日

TR4翻译后修饰与宫内发育迟缓大鼠代谢综合征的易感机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于HPLC-TPMT EAD-MS联用技术的天芪降糖胶囊中TPMT酶亲和活性成分研究

国家自然科学基金

0+阅读 · 2013年12月31日

G蛋白偶联受体激酶5在小鼠社会交互行为中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

前列腺癌组织特异microRNA表达谱研究

国家自然科学基金

0+阅读 · 2008年12月31日

食管癌转移高风险性相关的SNP位点筛查研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员