We present RDMAbox, a set of low level RDMA optimizations that provide better performance than previous approaches. The optimizations are packaged in easy-to-use kernel and user space libraries for applications and systems in data center. We demonstrate the flexibility and effectiveness of RDMAbox by implementing a kernel remote paging system and a user space file system using RDMAbox. RDMAbox employs two optimization techniques. First, we suggest RDMA request merging and chaining to further reduce the total number of I/O operations to the RDMA NIC. The I/O merge queue at the same time functions as a traffic regulator to enforce admission control and avoid overloading the NIC. Second, we propose Adaptive Polling to achieve higher efficiency of polling Work Completion than existing busy polling while maintaining the low CPU overhead of event trigger. Our implementation of a remote paging system with RDMAbox outperforms existing representative solutions with up to 4? throughput improvement and up to 83% decrease in average tail latency in bigdata workloads, and up to 83% reduction in completion time in machine learning workloads. Our implementation of a user space file system based on RDMAbox achieves up to 5.9? higher throughput over existing representative solutions.
翻译:我们提出RDMAbox,这是一套能提供比以往更好的业绩的低水平RDMA优化的RDMA系统。优化被包装在方便使用的内核和用户空间空间图书馆中,用于数据中心的应用和系统。我们通过使用RDMAbox实施一个内核远程传动系统和一个用户空间档案系统,来显示RDMAbox的灵活性和效力。RDMAbox采用两种优化技术。首先,我们建议RDMA请求合并和链条,以进一步减少I/O业务的总数,使其进一步减少与RDMA NIC的运行。I/O合并队列,同时作为交通调控器,以实施接收控制,避免给NIC造成超载。第二,我们建议调整投票,使投票工作完成效率高于现有繁忙的投票工作,同时保持低的CPU事件启动率。我们采用RDMAbox将现有的代表性解决方案升级到4个?通过改进投入,将大数据工作量的平均尾部拖拉减少83%。在完成机器学习工作量方面,将完成时间缩短至83%。我们实施了RMA系统,通过RDRDRDRDRDRDRDRDRDRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRRR