We propose uBFT, the first State Machine Replication (SMR) system to achieve microsecond-scale latency in data centers, while using only $2f{+}1$ replicas to tolerate $f$ Byzantine failures. The Byzantine Fault Tolerance (BFT) provided by uBFT is essential as pure crashes appear to be a mere illusion with real-life systems reportedly failing in many unexpected ways. uBFT relies on a small non-tailored trusted computing base -- disaggregated memory -- and consumes a practically bounded amount of memory. uBFT is based on a novel abstraction called Consistent Tail Broadcast, which we use to prevent equivocation while bounding memory. We implement uBFT using RDMA-based disaggregated memory and obtain an end-to-end latency of as little as 10us. This is at least 50$\times$ faster than MinBFT , a state of the art $2f{+}1$ BFT SMR based on Intel's SGX. We use uBFT to replicate two KV-stores (Memcached and Redis), as well as a financial order matching engine (Liquibook). These applications have low latency (up to 20us) and become Byzantine tolerant with as little as 10us more. The price for uBFT is a small amount of reliable disaggregated memory (less than 1 MiB), which in our prototype consists of a small number of memory servers connected through RDMA and replicated for fault tolerance.
翻译:我们建议使用国家机器复制系统UBFT(UBFT),这是第一个在数据中心实现微观二级悬浮的国家机器复制系统(SMR),只有2美元+1美元的复制版,以容忍拜占庭失败。由UBFT提供的拜占庭断裂容忍(BFT)至关重要,因为纯碰撞似乎只是实际生活系统的幻觉,据报告,在很多意外的情况下,实际生活系统都失败了。UBFT依赖于一个小型的不具体操作的可信任计算基础 -- -- 分解记忆 -- 并消耗了实际约束的记忆量。UBFT是基于一个叫做 " 一致尾盘广播 " 的新抽象的抽象,我们用它来防止微缩的信号,同时将记忆捆绑起来。我们使用基于RDMA的分类记忆(BFT) 实施UBFT(BFT), 至少50美元比MinBFT快,这是基于 Intels SGX(S) 和 虚拟服务器(UBFT) 的微值,我们用UFFT(Minal-B) astial listal astial) ascial astial as astial as as astialdaldaldaldal) (Me) (Minealdaldal) as as 10 a morealdaldaltialtial) as) (Mustialdaltialtialtial) asyal) asyaltialdaltialdaldaldaldaldaldaldaldaldaldaldaldald as as as as as as 10 (Mialti) 和10 10 (Mialdaldaldaldal)。