压缩文字的深学习模型:调查 (Compression of Deep Learning Models for Text: A Survey)

In recent years, the fields of natural language processing (NLP) and information retrieval (IR) have made tremendous progress thanksto deep learning models like Recurrent Neural Networks (RNNs), Gated Recurrent Units (GRUs) and Long Short-Term Memory (LSTMs)networks, and Transformer [120] based models like Bidirectional Encoder Representations from Transformers (BERT) [24], GenerativePre-training Transformer (GPT-2) [94], Multi-task Deep Neural Network (MT-DNN) [73], Extra-Long Network (XLNet) [134], Text-to-text transfer transformer (T5) [95], T-NLG [98] and GShard [63]. But these models are humongous in size. On the other hand,real world applications demand small model size, low response times and low computational power wattage. In this survey, wediscuss six different types of methods (Pruning, Quantization, Knowledge Distillation, Parameter Sharing, Tensor Decomposition, andSub-quadratic Transformer based methods) for compression of such models to enable their deployment in real industry NLP projects.Given the critical need of building applications with efficient and small models, and the large amount of recently published work inthis area, we believe that this survey organizes the plethora of work done by the 'deep learning for NLP' community in the past fewyears and presents it as a coherent story.

翻译：近年来,自然语言处理(NLP)和信息检索(IR)领域取得了巨大进展,这归功于以下深层学习模式:经常神经网络(NNNS)、Gated经常单元(GRUS)和长短期内存(LSTM)网络,以及基于双向编码器[120]的模型,如来自变换器(BERT)[24]、General Preating-培训变异器(GPT-2)[94]、多任务深神经网络(MT-DNNN)[73]、超任务网络(XLNet)[134]、文本到文本的转换变异器变换器(T5)[95]、GG-NLG[98]和GShard[63]。但是这些模型的大小是巨大的。另一方面,现实世界应用程序需要小规模模型、低反应时间和低计算能力瓦塔。在这次调查中,我们讨论了六种不同类型的方法(运行、量化、知识蒸馏、Plater 共享、Temet Shad 分享、Tens-lent-text 转换变换变换器变换器变换器变换器变换器变换器变换器变换器的大规模模型,这些模型的模型是用来在不断的大规模的模型和不断的模型,在不断变现的模型和不断变现变换成的模型中,在不断变式的模型中,在不断变式的模型中,在不断变式的模型和不断变式模型中进行。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

史上机器学习 &深度学习课程大合集，一站搞定，Deep Learning Drizzle

专知会员服务

175+阅读 · 2020年5月10日

深度学习搜索，Exploring Deep Learning for Search

专知会员服务

61+阅读 · 2020年5月9日

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

专知会员服务

43+阅读 · 2020年4月22日

MIT-深度学习Deep Learning State of the Art in 2020，87页ppt

专知会员服务

62+阅读 · 2020年2月17日