A1 is an in-memory distributed database used by the Bing search engine to support complex queries over structured data. The key enablers for A1 are availability of cheap DRAM and high speed RDMA (Remote Direct Memory Access) networking in commodity hardware. A1 uses FaRM as its underlying storage layer and builds the graph abstraction and query engine on top. The combination of in-memory storage and RDMA access requires rethinking how data is allocated, organized and queried in a large distributed system. A single A1 cluster can store tens of billions of vertices and edges and support a throughput of 350+ million of vertex reads per second with end to end query latency in single digit milliseconds. In this paper we describe the A1 data model, RDMA optimized data structures and query execution.
翻译:A1 是Bing搜索引擎用于支持结构化数据复杂查询的模拟分布式数据库。 A1 的关键推进因素是商品硬件中廉价的 DRAM 和高速 RDMA(远程直接内存访问)网络的可用性。 A1 将FARM 作为其基本储存层,并在顶部建立图示抽象和查询引擎。 模拟存储和 RDMA 访问的结合要求重新思考在大型分布式系统中如何分配、组织和查询数据。 单个 A1 组可以存储数十亿个顶端和边缘,支持3.5亿+百万个顶端的垂直读取量,最后以一位数毫秒结束查询。 在本文中,我们描述了 A1 数据模型、 RDMA 优化的数据结构和查询执行。