We propose the first Reversible Coherence Protocol (RCP), a new protocol designed from ground up that enables invisible speculative load. RCP takes a bold approach by including the speculative loads and merge/purge operation in the interface between processor and cache coherence, and allowing them to participate in the coherence protocol. It means, speculative load, ordinary load/store, and merge/purge can all affect the state of a given cache line. RCP is the first coherence protocol that enables the commit and squash of the speculative load among distributed cache components in a general memory hierarchy. RCP incurs an average slowdown of (3.0%,8.3%,7.4%) on (SPEC2006,SPEC2017,PARSEC), which is lower compared to (26.5%,12%,18.3%) in InvisiSpec and (3.2%,9.4%,24.2%) in CleanupSpec. The coherence traffic overhead is on average 46%, compared to 40% and 27% of InvisiSpec and CleanupSpec, respectively. Even with higher traffic overhead (~46%), the performance overhead of RCP is lower than InvisiSpec and comparable to CleanupSpec. It reveals a key advantage of RCP: the coherence actions triggered by the merge and purge operations are not in the critical path of the execution and can be performed in the cache hierarchy concurrently with processor execution
翻译:我们提出了第一个《腐蚀一致性议定书》(RCP),这是一个从地面上设计的新协议,可以带来隐形投机负负负。 RCP采取了大胆的做法,将投机性负荷和合并/清洗行动纳入处理器和缓存一致性之间的界面,并允许它们参与一致性协议。这意味着投机性负荷、普通负载/储存和合并/冲洗都可以影响特定缓冲线的状态。 RCP是第一个允许在一般记忆级中对分布式缓冲组件中进行和压缩投机性负载的统一协议。 RCP平均减速(3.0%,8.3%,7.4%)在(SPEC2006,SPEC2017,PARSEC)上(SPEC)下调(3.5%,12%,18.3%),并允许它们参与一致性协议。这意味着,投机性负荷、普通负载/储存和合并/合并/冲洗,都可以影响特定缓冲线条的状态。 一致性交通管理费平均为46%,而Inspec和ClectionS分别是40%和27%。即使交通管理费用较高(~46%),但REPEC执行的绩效管理比执行速度比关键行动要低。