We observe that emerging artificial intelligence, high-performance computing, and storage workloads pose new challenges for large-scale datacenter networking. RDMA over Converged Ethernet (RoCE) was an attempt to adopt modern Remote Direct Memory Access (RDMA) features into existing Ethernet installations. Now, a decade later, we revisit RoCE's design points and conclude that several of its shortcomings must be addressed to fulfill the demands of hyperscale datacenters. We predict that both the datacenter and high-performance computing markets will converge and adopt modernized Ethernet-based high-performance networking solutions that will replace TCP and RoCE within a decade.
翻译:我们观察到,新兴的人工智能、高性能计算和存储工作负载给大规模数据中心网络带来了新的挑战。以太网上的基于远程直接内存访问(RDMA)的RoCE是将现代RDMA特性引入现有以太网部署的一种尝试。现在,在十年后,我们重新审视RoCE的设计要点,并得出结论,必须解决它的几个缺点,以满足超大规模数据中心的需求。我们预测,数据中心和高性能计算市场将会融合,并采用现代化的Ethernet高性能网络解决方案,将在十年内取代TCP和RoCE。