《饮食法典》健康:简化培训前守则模式方案 (Diet Code is Healthy: Simplifying Programs for Pre-Trained Models of Code) - 专知论文

会员服务 ·

0

CodeBERT · Attention · 代码 · MoDELS · 计算成本 ·

2022 年 6 月 29 日

Diet Code is Healthy: Simplifying Programs for Pre-Trained Models of Code

翻译：《饮食法典》健康:简化培训前守则模式方案

Zhaowei Zhang,Hongyu Zhang,Beijun Shen,Xiaodong Gu

from arxiv, Accepted to be published in ESEC/FSE 2022

Pre-trained code representation models such as CodeBERT have demonstrated superior performance in a variety of software engineering tasks, yet they are often heavy in complexity, quadratically with the length of the input sequence. Our empirical analysis of CodeBERT's attention reveals that CodeBERT pays more attention to certain types of tokens and statements such as keywords and data-relevant statements. Based on these findings, we propose DietCodeBERT, which aims at lightweight leverage of large pre-trained models for source code. DietCodeBERT simplifies the input program of CodeBERT with three strategies, namely, word dropout, frequency filtering, and an attention-based strategy which selects statements and tokens that receive the most attention weights during pre-training. Hence, it gives a substantial reduction in the computational cost without hampering the model performance. Experimental results on two downstream tasks show that DietCodeBERT provides comparable results to CodeBERT with 40% less computational cost in fine-tuning and testing.

翻译：诸如 DCBERT 等经过预先培训的代码代表模型在各种软件工程任务中表现出了优异的绩效,然而,这些模型往往十分复杂,随输入序列的长度而四舍五入。我们对编码BERT 注意的经验分析表明,编码BERT 更多地注意某些类型的象征性和声明,例如关键词和数据相关说明。根据这些调查结果,我们提议DietCodeBERT,其目的是对大量预先培训的源代码模型发挥轻量的影响力。DietCodeBERT 简化了编码BERT的输入程序,采用三种战略,即字词退出、频率过滤和基于注意的战略,选择在培训前的加权中最受关注的语句和符号。因此,它大大降低了计算成本,而不妨碍模型的性能。关于两项下游任务的实验结果表明,DieCodeBERT 向编码计算机专家提供了可比的结果,在微调和测试时计算成本降低了40%。

0

相关内容

CodeBERT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

28+阅读 · 2022年2月20日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Progerin/PrelaminA诱发早老症的蛋白质组学研究

国家自然科学基金

1+阅读 · 2015年12月31日

AG-WUS-PcG-lncRNA互作对梅多雌蕊发育的调控

国家自然科学基金

0+阅读 · 2015年12月31日

Zn1-xCoxO一维纳米材料的高压电输运特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

微脉冲阈值下光凝对离体和在体视网膜色素上皮细胞的光生物调制效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

石墨烯－有机半导体界面结构及结构与性能关系

国家自然科学基金

0+阅读 · 2012年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

血管内皮细胞自噬的剪切应力调控及其在As中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型II-VI/III-V族多结叠层太阳电池材料与器件研究

国家自然科学基金

0+阅读 · 2012年12月31日

加速器束流横向密度均匀化研究

国家自然科学基金

0+阅读 · 2011年12月31日

拍动旋翼微型飞行器概念及其流动研究

国家自然科学基金

0+阅读 · 2009年12月31日

VAuLT: Augmenting the Vision-and-Language Transformer with the Propagation of Deep Language Representations

Arxiv

0+阅读 · 2022年8月18日

Resisting Adversarial Attacks in Deep Neural Networks using Diverse Decision Boundaries

Arxiv

0+阅读 · 2022年8月18日

On the implementation of an ETSI MEC using open source solutions

Arxiv

0+阅读 · 2022年8月18日

Summarizing Patients Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models

Arxiv

0+阅读 · 2022年8月17日

Learning to Represent Programs with Code Hierarchies

Arxiv

0+阅读 · 2022年8月17日

CommitBART: A Large Pre-trained Model for GitHub Commits

Arxiv

0+阅读 · 2022年8月17日

ASTRO: An AST-Assisted Approach for Generalizable Neural Clone Detection

Arxiv

0+阅读 · 2022年8月17日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

An Attentive Survey of Attention Models

Arxiv

19+阅读 · 2019年4月5日

DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding

Arxiv

16+阅读 · 2017年11月20日

VIP会员

文章信息

相关主题

相关VIP内容

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

28+阅读 · 2022年2月20日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

模型提取攻击与防御的系统综述：最新进展与展望

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

【CMU博士论文】用于物理模拟的高效深度学习模型

大模型解决方案白皮书：社交陪伴场景全流程落地指南

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

VAuLT: Augmenting the Vision-and-Language Transformer with the Propagation of Deep Language Representations

Arxiv

0+阅读 · 2022年8月18日

Resisting Adversarial Attacks in Deep Neural Networks using Diverse Decision Boundaries

Arxiv

0+阅读 · 2022年8月18日

On the implementation of an ETSI MEC using open source solutions

Arxiv

0+阅读 · 2022年8月18日

Summarizing Patients Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models

Arxiv

0+阅读 · 2022年8月17日

Learning to Represent Programs with Code Hierarchies

Arxiv

0+阅读 · 2022年8月17日

CommitBART: A Large Pre-trained Model for GitHub Commits

Arxiv

0+阅读 · 2022年8月17日

ASTRO: An AST-Assisted Approach for Generalizable Neural Clone Detection

Arxiv

0+阅读 · 2022年8月17日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

An Attentive Survey of Attention Models

Arxiv

19+阅读 · 2019年4月5日

DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding

Arxiv

16+阅读 · 2017年11月20日

相关基金

Progerin/PrelaminA诱发早老症的蛋白质组学研究

国家自然科学基金

1+阅读 · 2015年12月31日

AG-WUS-PcG-lncRNA互作对梅多雌蕊发育的调控

国家自然科学基金

0+阅读 · 2015年12月31日

Zn1-xCoxO一维纳米材料的高压电输运特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

微脉冲阈值下光凝对离体和在体视网膜色素上皮细胞的光生物调制效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

石墨烯－有机半导体界面结构及结构与性能关系

国家自然科学基金

0+阅读 · 2012年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

血管内皮细胞自噬的剪切应力调控及其在As中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型II-VI/III-V族多结叠层太阳电池材料与器件研究

国家自然科学基金

0+阅读 · 2012年12月31日

加速器束流横向密度均匀化研究

国家自然科学基金

0+阅读 · 2011年12月31日

拍动旋翼微型飞行器概念及其流动研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员