The adoption of very low latency persistent memory modules (PMMs) upends the long-established model of disaggregated file system access. Instead, by colocating computation and PMM storage, we can provide applications much higher I/O performance, sub-second application failover, and strong consistency. To demonstrate this, we built the Assise distributed file system, based on a persistent, replicated coherence protocol for managing a set of server-colocated PMMs as a fast, crash-recoverable cache between applications and slower disaggregated storage, such as SSDs. Unlike disaggregated file systems, Assise maximizes locality for all file IO by carrying out IO on colocated PMM whenever possible and minimizes coherence overhead by maintaining consistency at IO operation granularity, rather than at fixed block sizes. We compare Assise to Ceph/Bluestore, NFS, and Octopus on a cluster with Intel Optane DC PMMs and SSDs for common cloud applications and benchmarks, such as LevelDB, Postfix, and FileBench. We find that Assise improves write latency up to 22x, throughput up to 56x, fail-over time up to 103x, and scales up to 6x better than its counterparts, while providing stronger consistency semantics. Assise promises to beat the MinuteSort world record by 1.5x.
翻译:采用非常低的悬浮持久性内存模块(PMMs) 提升了长期建立的分解文件系统访问模式。 相反,通过对计算和 PMM 存储进行比对,我们可以提供更高I/O性能、次二应用程序故障和高度一致性的应用程序。 为了证明这一点,我们建立了一个Assise分布式文件系统,它基于一个持久、复制的统一协议,用于管理一套服务器合用的PMMM(PMs),作为快速的、可崩溃回收的缓存,在应用程序和SSDDs等较慢的分解存储器之间管理。不同于分解的文件系统,Assise尽可能通过在共享的 PMMM 上执行 IO, 尽可能通过保持IO 操作的一致性、 二级应用失败性,而不是固定的区块大小。我们把Assise 分布式文件系统建在Intel Optane DC PMMMs和SDSDSDS(SDS) 等通用云应用和基准上。 我们发现,Asusisionalfix, 改进了所有IODODBDO, 改进了LAx 上比x 10级的平比Ax 10级的平比Ax 更稳定性, 更稳定性为10到更稳定到22对等。