利用低资源方案编制语言为 " epedo-code " 生成转让知识</s> (Knowledge Transfer for Pseudo-code Generation from Low Resource Programming Language) - 专知论文

会员服务 ·

0

知识 (knowledge) · MoDELS · Automator · AIM · 语言模型化 ·

2023 年 3 月 16 日

Knowledge Transfer for Pseudo-code Generation from Low Resource Programming Language

翻译：利用低资源方案编制语言为 " epedo-code " 生成转让知识

Ankita Sontakke,Kanika Kalra,Manasi Patwardhan,Lovekesh Vig,Raveendra Kumar Medicherla,Ravindra Naik,Shrishti Pradhan

from arxiv, 11 pages, 1 figure, 5 tables

Generation of pseudo-code descriptions of legacy source code for software maintenance is a manually intensive task. Recent encoder-decoder language models have shown promise for automating pseudo-code generation for high resource programming languages such as C++, but are heavily reliant on the availability of a large code-pseudocode corpus. Soliciting such pseudocode annotations for codes written in legacy programming languages (PL) is a time consuming and costly affair requiring a thorough understanding of the source PL. In this paper, we focus on transferring the knowledge acquired by the code-to-pseudocode neural model trained on a high resource PL (C++) using parallel code-pseudocode data. We aim to transfer this knowledge to a legacy PL (C) with no PL-pseudocode parallel data for training. To achieve this, we utilize an Iterative Back Translation (IBT) approach with a novel test-cases based filtration strategy, to adapt the trained C++-to-pseudocode model to C-to-pseudocode model. We observe an improvement of 23.27% in the success rate of the generated C codes through back translation, over the successive IBT iteration, illustrating the efficacy of our approach.

翻译：生成用于软件维护的遗留源代码的伪代码描述是一个人工的艰巨任务。最近的编码- 编码解码语言模型已经显示出将诸如 C++ 等高资源编程语言的伪代码生成自动化的前景, 但是严重依赖大型代码假码软件库的可用性。为以遗留编程语言( PL) 写入的代码, 将这种伪代码描述引出伪代码是一个耗时且成本高昂的事情, 需要彻底理解源代码( PL ) 。在本文中, 我们侧重于将所培训的高资源( C++) 的代码- 假码神经系统模型获得的知识转让给使用平行代码/ 假码数据的高资源( C++ ) 。我们的目标是将这种知识转让给没有 PL- 伪码平行数据用于培训的遗留的 PL( C ) 。为了实现这一点, 我们用一种基于过滤策略的新颖的测试案例( IB) 翻译法, 将训练有素的C++- 至假码模式转换到 C- 。我们观察到在生成的C- 效能化方法的成功率方面提高了23. 。</s>

0

相关内容

知识 (knowledge)

知识 (knowledge)

通过学习、实践或探索所获得的认识、判断或技能。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

Th17细胞联合抗体介导的免疫反应在抗金黄色葡萄球菌感染中的保护作用

国家自然科学基金

0+阅读 · 2013年12月31日

水莱茵海默氏菌 (Rheinheimera aquimaris) 淬灭细菌群体感应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

嗜盐古菌CRISPR/Cas系统与基因组稳定性机制

国家自然科学基金

0+阅读 · 2012年12月31日

石墨烯材料的太赫兹响应

国家自然科学基金

0+阅读 · 2012年12月31日

马铃薯茎溃疡病原菌毒素的鉴定及其作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

鼻咽癌的EBNA3C-P53蛋白质复合体结构功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

掺杂超冷原子气体的低能激发

国家自然科学基金

0+阅读 · 2012年12月31日

基于谓词规划树的规划方法的研究

国家自然科学基金

1+阅读 · 2009年12月31日

纳米银广谱抗病毒作用及机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

纳米钼配合物的仿生合成与抗癌抗肿瘤研究

国家自然科学基金

0+阅读 · 2008年12月31日

Toward Adversarial Training on Contextualized Language Representation

Arxiv

0+阅读 · 2023年5月8日

On Contrastive Learning of Semantic Similarity forCode to Code Search

Arxiv

0+阅读 · 2023年5月5日

Large Language Models for Code: Security Hardening and Adversarial Testing

Arxiv

0+阅读 · 2023年5月5日

A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding

A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding

Arxiv

0+阅读 · 2023年5月5日

LLM2Loss: Leveraging Language Models for Explainable Model Diagnostics

Arxiv

0+阅读 · 2023年5月4日

Lifelong Embedding Learning and Transfer for Growing Knowledge Graphs

Arxiv

15+阅读 · 2022年11月29日

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Arxiv

11+阅读 · 2019年10月30日

Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods

Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods

Arxiv

88+阅读 · 2019年3月27日

Transferring Common-Sense Knowledge for Object Detection

Arxiv

12+阅读 · 2018年4月3日

Zero-Shot Transfer Learning for Event Extraction

Arxiv

10+阅读 · 2017年7月4日

VIP会员

文章信息

相关主题

知识 (knowledge)

语言模型化

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

美陆军五大转型方向

一种Agent自主性风险评估框架 | 最新文献

实时无人机指令处理：一种面向无人机系统的大语言模型方法

基于动态知识图谱的人工智能代理自主研究周期 | 文献

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Toward Adversarial Training on Contextualized Language Representation

Arxiv

0+阅读 · 2023年5月8日

On Contrastive Learning of Semantic Similarity forCode to Code Search

Arxiv

0+阅读 · 2023年5月5日

Large Language Models for Code: Security Hardening and Adversarial Testing

Arxiv

0+阅读 · 2023年5月5日

A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding

A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding

Arxiv

0+阅读 · 2023年5月5日

LLM2Loss: Leveraging Language Models for Explainable Model Diagnostics

Arxiv

0+阅读 · 2023年5月4日

Lifelong Embedding Learning and Transfer for Growing Knowledge Graphs

Arxiv

15+阅读 · 2022年11月29日

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Arxiv

11+阅读 · 2019年10月30日

Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods

Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods

Arxiv

88+阅读 · 2019年3月27日

Transferring Common-Sense Knowledge for Object Detection

Arxiv

12+阅读 · 2018年4月3日

Zero-Shot Transfer Learning for Event Extraction

Arxiv

10+阅读 · 2017年7月4日

相关基金

Th17细胞联合抗体介导的免疫反应在抗金黄色葡萄球菌感染中的保护作用

国家自然科学基金

0+阅读 · 2013年12月31日

水莱茵海默氏菌 (Rheinheimera aquimaris) 淬灭细菌群体感应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

嗜盐古菌CRISPR/Cas系统与基因组稳定性机制

国家自然科学基金

0+阅读 · 2012年12月31日

石墨烯材料的太赫兹响应

国家自然科学基金

0+阅读 · 2012年12月31日

马铃薯茎溃疡病原菌毒素的鉴定及其作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

鼻咽癌的EBNA3C-P53蛋白质复合体结构功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

掺杂超冷原子气体的低能激发

国家自然科学基金

0+阅读 · 2012年12月31日

基于谓词规划树的规划方法的研究

国家自然科学基金

1+阅读 · 2009年12月31日

纳米银广谱抗病毒作用及机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

纳米钼配合物的仿生合成与抗癌抗肿瘤研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员