We propose the first Reversible Coherence Protocol (RCP), a new protocol designed from ground up that enables invisible speculative load. RCP takes a bold approach by including the speculative loads and merge/purge operation in the interface between processor and cache coherence, and allowing them to participate in the coherence protocol. It means, speculative load, ordinary load/store, and merge/purge can all affect the state of a given cache line. RCP is the first coherence protocol that enables the commit and squash of the speculative load among distributed cache components in a general memory hierarchy. RCP incurs an average slowdown of (3.0%, 7.4%) on (SPEC,PARSEC), compared to (26.5%,18.3%) in InvisiSpec and (3.2%,24.2%) in CleanupSpec. The coherence traffic overhead is on average 46%, compared to 40% and 27% of InvisiSpec and CleanupSpec, respectively. Even with higher traffic overhead (~ 46%), the performance overhead of RCP is lower than InvisiSpec and comparable to CleanupSpec. It reveals a key advantage of RCP: the coherence actions triggered by the merge and purge operations are not in the critical path of the execution and can be performed in the cache hierarchy concurrently with processor execution.
翻译:我们提出了第一个《腐蚀一致性议定书》,这是一个自下而上的新协议,它设计了隐蔽的投机负载。RCP采取了大胆的做法,将投机性负荷和合并/清洗行动纳入处理器和缓存一致性之间的界面,并允许它们参与一致性协议。这意味着,投机性负荷、普通负载/储存和合并/冲洗都可以影响特定缓冲线的状态。RCP是第一个在总体记忆级中使分布式缓冲部件之间承担和压碎投机性负荷的一致协议。RCP在(SPEC,PARSEC)(SPEC, )(SPEC, 7.4%)(SPEC, )上的平均减速幅度为3.5%、18.3%,而在Invispec和(3.2%,24.2%)中则允许它们参与一致性协议。 一致性交通管理平均为46%,而InvisiSpec和Clecurrupro(c)的40%和27%。即使交通管理费较高(~46%),但RCP的绩效管理费也低于Invispec,可与清洁Spec类相比。它展示了一个关键的执行过程,其关键的优势在于:在RCP执行过程中,其执行过程中,其关键的稳定性执行是同步行动。