JSC NICEVT has developed the Angara high-speed interconnect with 4D torus topology. The Angara interconnect router implements deterministic routing based on the bubble flow control, a direction order routing (DOR) and direction bits rules. The router chip also supports non standard First Step / Last Step for bypassing failed nodes and links, these steps can violate the DOR rule. In the previous work we have proposed an algorithm for generation and analysis of routing tables that guarantees no deadlocks in the Angara interconnect. It is based on a breadth-first search algorithm in a graph and it practically does not take into consideration communication channel load. Also we have never evaluated the influence of routing table generation algorithm on the performance of a real-world Angara based cluster. In this paper we present a routing graph notation that provides a possibility to build routes in the torus topology of the Angara interconnect. We propose a deadlock-free routing algorithm based on a fast single-source shortest path algorithm for the deterministic Angara routing with a single virtual channel. We evaluated the considered routing algorithms on a 32-node Desmos cluster system and benchmarked the proposed algorithm performance improvement of 11.1% for the Alltoall communication pattern and of more than 5% for the FT and IS application kernels.
翻译:CTJSC NIDEVT 开发了安加拉高速连接与 4D 星象学 4D 星象学的连接。 安加拉互联路由器基于气泡流控制、 方向秩序路由( DOR) 和方向位规则,执行确定路由路由规则。路由器芯片还支持了绕过失败节点和链接的不标准第一/最后一步,这些步骤可能违反 DOR 规则。在先前的工作中,我们提出了一种生成和分析路由表的算法,保证安加拉互连不陷入僵局。它基于一个图中的宽度第一搜索算法,实际上它没有考虑到通信频道负荷。我们从未评估过路由制表生成算法对基于真实世界安加拉集群的业绩的影响。在本文中,我们提出了一个路由路由图标记符标记,提供了在安加拉互连的托斯顶学中建立路线的可能性。我们提出一种无僵局路由算算法,其基础是快速的、最短的路径算法,用于确定性安加拉诺规则的安加拉路段,实际上没有考虑到通信频道负荷的通信负荷系统。我们用一个单一的SDIS路由系统,用于所有SDIS标变变变的系统。