MultPIM: 处理记忆中的快速状态乘数 (MultPIM: Fast Stateful Multiplication for Processing-in-Memory) - 专知论文

会员服务 ·

0

可约的 · state-of-the-art · 优化器 · FAST · Performer ·

2021 年 8 月 30 日

MultPIM: Fast Stateful Multiplication for Processing-in-Memory

翻译：MultPIM: 处理记忆中的快速状态乘数

Orian Leitersdorf,Ronny Ronen,Shahar Kvatinsky

Processing-in-memory (PIM) seeks to eliminate computation/memory data transfer using devices that support both storage and logic. Stateful logic techniques such as IMPLY, MAGIC and FELIX can perform logic gates within memristive crossbar arrays with massive parallelism. Multiplication via stateful logic is an active field of research due to the wide implications. Recently, RIME has become the state-of-the-art algorithm for stateful single-row multiplication by using memristive partitions, reducing the latency of the previous state-of-the-art by 5.1x. In this paper, we begin by proposing novel partition-based computation techniques for broadcasting and shifting data. Then, we design an in-memory multiplication algorithm based on the carry-save add-shift (CSAS) technique. Finally, we detail specific logic optimizations to the algorithm that further reduce latency. These contributions constitute MultPIM, a multiplier that reduces state-of-the-art time complexity from quadratic to linear-log. For 32-bit numbers, MultPIM improves latency by an additional 3.8x over RIME, while even slightly reducing area overhead. Furthermore, we optimize MultPIM for full-precision matrix-vector multiplication and demonstrate 22.0x latency improvement over FloatPIM matrix-vector multiplication.

翻译：用于支持存储和逻辑的装置的计算/ 模拟数据传输( PIM ) 。 IMPLY、 MAGIC 和 FELIX 等状态逻辑技术可以在弥漫的跨条形阵列内使用极大的平行阵列中执行逻辑门。由于影响广泛, 以显性逻辑进行乘法是一个积极的研究领域。最近, RIME 已经成为了使用中间分隔线使状态单行倍增的最先进的算法, 减少了5. 1%x 之前的状态的静态。在本文中, 我们首先提出新的基于偏移的计算技术, 用于广播和移动数据。然后, 我们设计了一个基于随附变换( CSAS) 技术的内模数倍倍倍倍倍倍倍倍增算法。最后, 我们详细介绍了进一步减少延缩缩的算法的具体逻辑优化。这些贡献构成MultPIM, 一种将州际矩阵复杂性从四度降为线性。对于32位数数字来说, MultPIM 将最小化的多式矩阵改进, 以微缩缩缩缩缩缩缩图区域。

0

相关内容

可约的

tf_geometric — 基于TensorFlow的友好高效的图神经网络（GNN）库

tf_geometric — 基于TensorFlow的友好高效的图神经网络（GNN）库

专知会员服务

26+阅读 · 2021年8月9日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【UCLA-微软-WWW2020】异构图Transformer，Heterogeneous Graph Transformer

【UCLA-微软-WWW2020】异构图Transformer，Heterogeneous Graph Transformer

专知会员服务

137+阅读 · 2020年3月8日

【ECML-PKDD 2019】多维时间序列和事件日志的模式挖掘和异常检测框架（A framework for pattern mining and anomalydetection in multi-dimensional time series andevent logs）

【ECML-PKDD 2019】多维时间序列和事件日志的模式挖掘和异常检测框架（A framework for pattern mining and anomalydetection in multi-dimensional time series andevent logs）

专知会员服务

38+阅读 · 2019年12月1日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

已删除

将门创投

4+阅读 · 2017年12月12日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

On Multiply Robust Mendelian Randomization (MR$^2$) With Many Invalid Genetic Instruments

Arxiv

0+阅读 · 2021年10月20日

Pattern Division Random Access (PDRA) for M2M Communications with Massive MIMO Systems

Arxiv

0+阅读 · 2021年10月20日

PR-CIM: a Variation-Aware Binary-Neural-Network Framework for Process-Resilient Computation-in-memory

Arxiv

0+阅读 · 2021年10月19日

In-memory Multi-valued Associative Processor

Arxiv

0+阅读 · 2021年10月18日

Energy-Efficient Massive MIMO for Serving Multiple Federated Learning Groups

Arxiv

0+阅读 · 2021年10月17日

Faster Algorithms for Bounded-Difference Min-Plus Product

Arxiv

0+阅读 · 2021年10月17日

Least Squares on GPUs in Multiple Double Precision

Arxiv

0+阅读 · 2021年10月15日

Faster Modular Composition

Arxiv

0+阅读 · 2021年10月15日

Multiplying Matrices Without Multiplying

Arxiv

9+阅读 · 2021年6月21日

Pointer Networks

Arxiv

4+阅读 · 2017年1月2日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

tf_geometric — 基于TensorFlow的友好高效的图神经网络（GNN）库

tf_geometric — 基于TensorFlow的友好高效的图神经网络（GNN）库

专知会员服务

26+阅读 · 2021年8月9日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【UCLA-微软-WWW2020】异构图Transformer，Heterogeneous Graph Transformer

【UCLA-微软-WWW2020】异构图Transformer，Heterogeneous Graph Transformer

专知会员服务

137+阅读 · 2020年3月8日

【ECML-PKDD 2019】多维时间序列和事件日志的模式挖掘和异常检测框架（A framework for pattern mining and anomalydetection in multi-dimensional time series andevent logs）

【ECML-PKDD 2019】多维时间序列和事件日志的模式挖掘和异常检测框架（A framework for pattern mining and anomalydetection in multi-dimensional time series andevent logs）

专知会员服务

38+阅读 · 2019年12月1日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

热门VIP内容

开通专知VIP会员享更多权益服务

数据要素发展报告(2025年)：附下载

人工智能代理提升战时舰船战备水平

【NeurIPS2025教程】大语言模型规划

NeurIPS 2025 教程：深度学习训练不稳定性的理论洞见

相关资讯

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

已删除

将门创投

4+阅读 · 2017年12月12日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

相关论文

On Multiply Robust Mendelian Randomization (MR$^2$) With Many Invalid Genetic Instruments

Arxiv

0+阅读 · 2021年10月20日

Pattern Division Random Access (PDRA) for M2M Communications with Massive MIMO Systems

Arxiv

0+阅读 · 2021年10月20日

PR-CIM: a Variation-Aware Binary-Neural-Network Framework for Process-Resilient Computation-in-memory

Arxiv

0+阅读 · 2021年10月19日

In-memory Multi-valued Associative Processor

Arxiv

0+阅读 · 2021年10月18日

Energy-Efficient Massive MIMO for Serving Multiple Federated Learning Groups

Arxiv

0+阅读 · 2021年10月17日

Faster Algorithms for Bounded-Difference Min-Plus Product

Arxiv

0+阅读 · 2021年10月17日

Least Squares on GPUs in Multiple Double Precision

Arxiv

0+阅读 · 2021年10月15日

Faster Modular Composition

Arxiv

0+阅读 · 2021年10月15日

Multiplying Matrices Without Multiplying

Arxiv

9+阅读 · 2021年6月21日

Pointer Networks

Arxiv

4+阅读 · 2017年1月2日

微信扫码咨询专知VIP会员