记忆处理现代初级启动器 (A Modern Primer on Processing in Memory)

Modern computing systems are overwhelmingly designed to move data to computation. This design choice goes directly against at least three key trends in computing that cause performance, scalability and energy bottlenecks: (1) data access is a key bottleneck as many important applications are increasingly data-intensive, and memory bandwidth and energy do not scale well, (2) energy consumption is a key limiter in almost all computing platforms, especially server and mobile systems, (3) data movement, especially off-chip to on-chip, is very expensive in terms of bandwidth, energy and latency, much more so than computation. These trends are especially severely-felt in the data-intensive server and energy-constrained mobile systems of today. At the same time, conventional memory technology is facing many technology scaling challenges in terms of reliability, energy, and performance. As a result, memory system architects are open to organizing memory in different ways and making it more intelligent, at the expense of higher cost. The emergence of 3D-stacked memory plus logic, the adoption of error correcting codes inside the latest DRAM chips, proliferation of different main memory standards and chips, specialized for different purposes (e.g., graphics, low-power, high bandwidth, low latency), and the necessity of designing new solutions to serious reliability and security issues, such as the RowHammer phenomenon, are an evidence of this trend. This chapter discusses recent research that aims to practically enable computation close to data, an approach we call processing-in-memory (PIM). PIM places computation mechanisms in or near where the data is stored (i.e., inside the memory chips, in the logic layer of 3D-stacked memory, or in the memory controllers), so that data movement between the computation units and memory is reduced or eliminated.

翻译：现代计算系统被绝大多数人设计成可以移动数据进行计算。这种设计选择直接与至少三种导致性能、可缩缩缩性和能源瓶颈的关键计算趋势相对立:(1) 数据访问是一个关键瓶颈,因为许多重要应用程序日益数据密集,记忆带宽和能源规模不高;(2) 能源消耗是几乎所有计算平台,特别是服务器和移动系统的关键限制;(3) 数据移动,特别是离芯到芯片的数据移动,在带宽、能量和延缓性方面,比计算更昂贵。这些趋势在当今数据密集服务器和能源紧凑的移动系统中尤为严重。与此同时,传统记忆技术正在面临许多技术在可靠性、能量和性能方面扩大的挑战。因此,记忆系统设计师可以以不同的方式组织记忆,特别是服务器和移动到芯片的离子,在最新的DRAM芯片中采用错误校正法。不同的主要记忆标准和芯片的激增,用于不同目的(e.g.c.r.r.r.r.r.r.r.d)内部的存储技术正在以不同的方式, 快速的存储数据流数据流流流流流流化数据流流化数据流化, 从而成为了。

相关内容

Performance

关注 3

Performance：International Symposium on Computer Performance Modeling, Measurements and Evaluation。 Explanation：计算机性能建模、测量和评估国际研讨会。 Publisher：ACM。 SIT：http://dblp.uni-trier.de/db/conf/performance/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日