Associative cache memory significantly influences processor performance and energy consumption. Because it occupies over half of the chip area, cache memory is highly susceptible to transient and permanent faults, posing reliability challenges. As the only hardware-managed memory module, the cache tag array is the most active and critical component, dominating both energy usage and error rate. Tag partitioning is a widely used technique to reduce tag-access energy and enhance reliability. It divides tag comparison into two phases: first comparing the k lower bits, and then activating only the matching tag entries to compare the remaining higher bits. The key design parameter is the selection of the tag-splitting point k, which determines how many reads are eliminated. However, prior studies have chosen k intuitively, randomly, or empirically, without justification. Even experimentally determined values are ad-hoc and do not generalize across cache configurations due to high sensitivity to architectural parameters. In this paper, we analytically show that choosing k too large or too small substantially reduces the effectiveness of tag partitioning. We then derive a formulation that determines the optimal splitting point based on cache configuration parameters. The formulation is convex, differentiable, and capable of precisely quantifying tag-partitioning efficiency for any k and configuration. To validate our model, we experimentally evaluate tag-partitioning efficiency and optimal k across a broad set of cache designs and demonstrate close agreement between analytical and experimental results. The proposed formulation enables designers and researchers to instantly compute the optimal tag-splitting point and accurately estimate tag-read reduction.
翻译:关联缓存存储器对处理器性能与能耗具有显著影响。由于占据超过一半的芯片面积,缓存存储器极易受到瞬态故障与永久性故障的影响,从而带来可靠性挑战。作为唯一由硬件直接管理的内存模块,缓存标签阵列是最活跃且最关键的组件,其能耗与错误率均占据主导地位。标签分区是一种广泛用于降低标签访问能耗并提升可靠性的技术。该技术将标签比较分为两个阶段:首先比较较低的k位,随后仅激活匹配的标签条目以比较剩余的高位。其关键设计参数在于标签分割点k的选择,该参数决定了可消除的读取操作数量。然而,先前研究对k的选择多基于直觉、随机或经验判断,缺乏理论依据。即使通过实验确定的数值也具有临时性,且由于对架构参数的高度敏感性,无法在不同缓存配置间泛化。本文通过理论分析证明,选择过大或过小的k值会显著降低标签分区的效能。我们进而推导出基于缓存配置参数确定最优分割点的数学表达式。该表达式具有凸性、可微性,能够精确量化任意k值与配置下标签分区的效率。为验证模型,我们通过实验评估了多种缓存设计下的标签分区效率与最优k值,结果表明理论分析与实验结果高度吻合。所提出的表达式使设计者与研究者能够即时计算最优标签分割点,并准确预估标签读取操作的减少量。