Millions of sensors, mobile applications and machines now generate billions of events. Specialized many-core key-value stores (KVSs) can ingest and index these events at high rates (over 100 Mops/s on one machine) if events are generated on the same machine; however, to be practical and cost-effective they must ingest events over the network and scale across cloud resources elastically. We present Shadowfax, a new distributed KVS based on FASTER, that transparently spans DRAM, SSDs, and cloud blob storage while serving 130 Mops/s/VM over commodity Azure VMs using conventional Linux TCP. Beyond high single-VM performance, Shadowfax uses a unique approach to distributed reconfiguration that avoids any server-side key ownership checks or cross-core coordination both during normal operation and migration. Hence, Shadowfax can shift load in 17 s to improve system throughput by 10 Mops/s with little disruption. Compared to the state-of-the-art, it has 8x better throughput (than Seastar+memcached) and avoids costly I/O to move cold data during migration. On 12 machines, Shadowfax retains its high throughput to perform 930 Mops/s, which, to the best of our knowledge, is the highest reported throughput for a distributed KVS used for large-scale data ingestion and indexing.
翻译:现在,数百万个传感器、移动应用程序和机器正在产生数十亿个事件。如果在同一台机器上产生事件,专门的多核心关键价值仓库(KVS)能够以高速(超过100兆秒/在一台机器上)记录和索引这些事件;但是,为了做到实用和具有成本效益,它们必须在网络上和云层资源中以动态的方式记录事件,在云层资源中以快速的方式进行规模。我们展示了基于FASTER的新分布式KVS的Shadfax传真,它透明地覆盖DRAM、SSDSDs和云球球存储,同时利用传统的Linux TCP为130兆ps/s/VM 商品Azure VMs提供130兆/s/s/VMmmm。除了单一VM高性能外,Showapfax还采用独特的配置方法,在正常操作和迁移期间避免服务器-端关键所有权检查或交叉协调。因此,Shapfax可以将负载量转移17秒,将系统吞10兆/秒,但几乎没有干扰。相比,它有8x透视量,它有8x通过Star+emcax分布,并避免高压数据,在12号中进行最昂贵的IM/O数据。