BabyBear: 昂贵语言模型的廉价推论分解 (BabyBear: Cheap inference triage for expensive language models) - 专知论文

会员服务 ·

0

语言模型化 · MoDELS · 推断 · 级联 · 模型评估 ·

2022 年 5 月 24 日

BabyBear: Cheap inference triage for expensive language models

翻译：BabyBear: 昂贵语言模型的廉价推论分解

Leila Khalili,Yao You,John Bohannon

from arxiv, 7 pages, 6 figures

Transformer language models provide superior accuracy over previous models but they are computationally and environmentally expensive. Borrowing the concept of model cascading from computer vision, we introduce BabyBear, a framework for cascading models for natural language processing (NLP) tasks to minimize cost. The core strategy is inference triage, exiting early when the least expensive model in the cascade achieves a sufficiently high-confidence prediction. We test BabyBear on several open source data sets related to document classification and entity recognition. We find that for common NLP tasks a high proportion of the inference load can be accomplished with cheap, fast models that have learned by observing a deep learning model. This allows us to reduce the compute cost of large-scale classification jobs by more than 50% while retaining overall accuracy. For named entity recognition, we save 33% of the deep learning compute while maintaining an F1 score higher than 95% on the CoNLL benchmark.

翻译：变换语言模型比先前的模型提供更精准的精度, 但这些模型在计算上和环境上都非常昂贵。借用计算机视觉中的模型级联概念, 我们引入BabyBear, 这是自然语言处理任务( NLP) 的级联模型框架, 以最大限度地降低成本。核心策略是推论分级, 当级联中最昂贵的模型达到足够高的可信度预测值时, 提前退出。我们测试BabyBear, 测试与文件分类和实体识别有关的多个开放源数据集。我们发现, 对于通用的 NLP 任务, 高比例的推论负荷可以用通过观察深层学习模型所学的廉价快速模型完成。这使我们能够将大型分类任务的计算成本降低50%以上, 同时保持总体准确性。对于名称实体的识别, 我们节省了33%的深度学习计算, 同时在CONLL基准上保持高于95%的F1分数。

0

相关内容

语言模型化

语言模型化

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

77+阅读 · 2022年3月15日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

17+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

GJ的Ca2+传递引起钙稳态失衡诱导内质网应激在肝移植术后急性肾损伤中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

ZEB2基因3’UTR区SNPs与非小细胞肺癌放射敏感性的相关性及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

关于Lp多调和边值问题的若干研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

正交双波长双脉冲LA-LIBS技术及其在原位元素显微分析中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

Runge-Kutta间断Galerkin方法的各向异性自适应方法及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

颈内静脉和椎静脉结构和血流动力学异常与颅内静脉窦血栓形成相关性研究

国家自然科学基金

0+阅读 · 2012年12月31日

临近空间高超声速目标宽带电磁特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

茶多酚抑制HMGB1介导的自噬增强膀胱癌化疗敏感性的作用及其机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

拋物奇异积分算子有界性及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

Embedding Recycling for Language Models

Embedding Recycling for Language Models

Arxiv

0+阅读 · 2022年7月11日

Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference

Arxiv

0+阅读 · 2022年7月11日

Enabling Binary Neural Network Training on the Edge

Arxiv

0+阅读 · 2022年7月10日

Myers-Briggs personality classification from social media text using pre-trained language models

Arxiv

0+阅读 · 2022年7月10日

LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action

Arxiv

0+阅读 · 2022年7月10日

G2L: A Geometric Approach for Generating Pseudo-labels that Improve Transfer Learning

Arxiv

0+阅读 · 2022年7月7日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Arxiv

15+阅读 · 2018年10月11日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

Distance-based Self-Attention Network for Natural Language Inference

Arxiv

10+阅读 · 2017年12月6日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

77+阅读 · 2022年3月15日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

中文版 | 人工智能与未来战争：算法战的崛起

《建模与仿真（M&S）导论》32页最新报告

《美陆军多域作战训练范围指南（适用于连级至旅级指挥官）》最新84页报告

《超视距空战中的仿真与机器学习技术综述》最新长综述

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

17+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

相关论文

Embedding Recycling for Language Models

Embedding Recycling for Language Models

Arxiv

0+阅读 · 2022年7月11日

Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference

Arxiv

0+阅读 · 2022年7月11日

Enabling Binary Neural Network Training on the Edge

Arxiv

0+阅读 · 2022年7月10日

Myers-Briggs personality classification from social media text using pre-trained language models

Arxiv

0+阅读 · 2022年7月10日

LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action

Arxiv

0+阅读 · 2022年7月10日

G2L: A Geometric Approach for Generating Pseudo-labels that Improve Transfer Learning

Arxiv

0+阅读 · 2022年7月7日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Arxiv

15+阅读 · 2018年10月11日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

Distance-based Self-Attention Network for Natural Language Inference

Arxiv

10+阅读 · 2017年12月6日

相关基金

GJ的Ca2+传递引起钙稳态失衡诱导内质网应激在肝移植术后急性肾损伤中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

ZEB2基因3’UTR区SNPs与非小细胞肺癌放射敏感性的相关性及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

关于Lp多调和边值问题的若干研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

正交双波长双脉冲LA-LIBS技术及其在原位元素显微分析中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

Runge-Kutta间断Galerkin方法的各向异性自适应方法及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

颈内静脉和椎静脉结构和血流动力学异常与颅内静脉窦血栓形成相关性研究

国家自然科学基金

0+阅读 · 2012年12月31日

临近空间高超声速目标宽带电磁特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

茶多酚抑制HMGB1介导的自噬增强膀胱癌化疗敏感性的作用及其机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

拋物奇异积分算子有界性及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员