用于大型建议系统嵌入装置的具有频度的软件缓存 (A Frequency-aware Software Cache for Large Recommendation System Embeddings) - 专知论文

会员服务 ·

0

cache · GPU · 推荐系统 · Learning · prototype ·

2022 年 8 月 8 日

A Frequency-aware Software Cache for Large Recommendation System Embeddings

翻译：用于大型建议系统嵌入装置的具有频度的软件缓存

Jiarui Fang,Geng Zhang,Jiatong Han,Shenggui Li,Zhengda Bian,Yongbin Li,Jin Liu,Yang You

Deep learning recommendation models (DLRMs) have been widely applied in Internet companies. The embedding tables of DLRMs are too large to fit on GPU memory entirely. We propose a GPU-based software cache approaches to dynamically manage the embedding table in the CPU and GPU memory space by leveraging the id's frequency statistics of the target dataset. Our proposed software cache is efficient in training entire DLRMs on GPU in a synchronized update manner. It is also scaled to multiple GPUs in combination with the widely used hybrid parallel training approaches. Evaluating our prototype system shows that we can keep only 1.5% of the embedding parameters in the GPU to obtain a decent end-to-end training speed.

翻译：深学习建议模型(DLRM)已在互联网公司中广泛应用。 DLRM 嵌入表太大,无法完全适合 GPU 内存。我们建议采用基于 GPU 的软件缓存方法来动态管理 CPU 和 GPU 内存空间的嵌入表。我们提议的软件缓存能够有效地以同步更新的方式在 GPU 上对整个 DLRM 进行培训。还将它与广泛使用的混合平行培训方法相结合, 扩大到多个 GPU 。对原型系统的评估显示, 我们只能保留 CPU 内嵌入参数的1.5%, 以获得体面的端到端培训速度。

0

相关内容

cache

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

114+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

72+阅读 · 2022年3月15日

【SIGIR2021】ScaleFreeCTR：超大规模Embedding推荐模型分布式训练系统

专知会员服务

27+阅读 · 2021年4月26日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

75+阅读 · 2020年7月26日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

96+阅读 · 2020年6月21日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日

【大规模数据系统，552页ppt】Large-scale Data Systems

【大规模数据系统，552页ppt】Large-scale Data Systems

专知会员服务

57+阅读 · 2019年12月21日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

18+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

2+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

基于高自旋簇构筑单链磁体研究

国家自然科学基金

0+阅读 · 2012年12月31日

自旋冰体系中磁致多铁性的研究

国家自然科学基金

0+阅读 · 2012年12月31日

单分子的电子相干效应及量子操控

国家自然科学基金

0+阅读 · 2012年12月31日

新型PMN-PT基铁电光学陶瓷制备及其高电光特性机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

内嵌量子点三结(Al)GaInP/InGaAs/Ge太阳电池材料的MBE生长及器件相关问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

以SiC为衬底的Si/SiC异质结制备及其光电特性

国家自然科学基金

0+阅读 · 2010年12月31日

智能化高分辨太阳耀斑观测终端的研制

国家自然科学基金

0+阅读 · 2009年12月31日

用于兰州HIRFL－CSR内外靶实验飞行时间探测器的多气隙电阻板室研制

国家自然科学基金

0+阅读 · 2009年12月31日

以离子液体为溶剂的丙烯腈ARGET ATRP研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于非线性光纤的慢光可调谐延迟技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

On the Use of Deep Learning in Software Defect Prediction

Arxiv

0+阅读 · 2022年10月5日

Graph-augmented Learning to Rank for Querying Large-scale Knowledge Graph

Arxiv

0+阅读 · 2022年10月5日

Calibration Matters: Tackling Maximization Bias in Large-scale Advertising Recommendation Systems

Arxiv

0+阅读 · 2022年10月2日

Updating Embeddings for Dynamic Knowledge Graphs

Arxiv

20+阅读 · 2021年9月22日

Interest-aware Message-Passing GCN for Recommendation

Interest-aware Message-Passing GCN for Recommendation

Arxiv

11+阅读 · 2021年2月19日

Knowledge-aware Graph Neural Networks with Label Smoothness Regularization for Recommendation

Arxiv

11+阅读 · 2019年6月13日

Causal Embeddings for Recommendation

Arxiv

22+阅读 · 2018年8月3日

Attention-based Group Recommendation

Arxiv

13+阅读 · 2018年4月18日

Learning over Knowledge-Base Embeddings for Recommendation

Arxiv

22+阅读 · 2018年3月22日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

114+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

72+阅读 · 2022年3月15日

【SIGIR2021】ScaleFreeCTR：超大规模Embedding推荐模型分布式训练系统

专知会员服务

27+阅读 · 2021年4月26日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

75+阅读 · 2020年7月26日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

96+阅读 · 2020年6月21日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日

【大规模数据系统，552页ppt】Large-scale Data Systems

【大规模数据系统，552页ppt】Large-scale Data Systems

专知会员服务

57+阅读 · 2019年12月21日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

18+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

45+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

39+阅读 · 2019年10月9日

热门VIP内容

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

2+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

相关论文

On the Use of Deep Learning in Software Defect Prediction

Arxiv

0+阅读 · 2022年10月5日

Graph-augmented Learning to Rank for Querying Large-scale Knowledge Graph

Arxiv

0+阅读 · 2022年10月5日

Calibration Matters: Tackling Maximization Bias in Large-scale Advertising Recommendation Systems

Arxiv

0+阅读 · 2022年10月2日

Updating Embeddings for Dynamic Knowledge Graphs

Arxiv

20+阅读 · 2021年9月22日

Interest-aware Message-Passing GCN for Recommendation

Interest-aware Message-Passing GCN for Recommendation

Arxiv

11+阅读 · 2021年2月19日

Knowledge-aware Graph Neural Networks with Label Smoothness Regularization for Recommendation

Arxiv

11+阅读 · 2019年6月13日

Causal Embeddings for Recommendation

Arxiv

22+阅读 · 2018年8月3日

Attention-based Group Recommendation

Arxiv

13+阅读 · 2018年4月18日

Learning over Knowledge-Base Embeddings for Recommendation

Arxiv

22+阅读 · 2018年3月22日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

相关基金

基于高自旋簇构筑单链磁体研究

国家自然科学基金

0+阅读 · 2012年12月31日

自旋冰体系中磁致多铁性的研究

国家自然科学基金

0+阅读 · 2012年12月31日

单分子的电子相干效应及量子操控

国家自然科学基金

0+阅读 · 2012年12月31日

新型PMN-PT基铁电光学陶瓷制备及其高电光特性机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

内嵌量子点三结(Al)GaInP/InGaAs/Ge太阳电池材料的MBE生长及器件相关问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

以SiC为衬底的Si/SiC异质结制备及其光电特性

国家自然科学基金

0+阅读 · 2010年12月31日

智能化高分辨太阳耀斑观测终端的研制

国家自然科学基金

0+阅读 · 2009年12月31日

用于兰州HIRFL－CSR内外靶实验飞行时间探测器的多气隙电阻板室研制

国家自然科学基金

0+阅读 · 2009年12月31日

以离子液体为溶剂的丙烯腈ARGET ATRP研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于非线性光纤的慢光可调谐延迟技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员