With the increasing scale of communication networks, the likelihood of failures grows as well. Since these networks form a critical backbone of our digital society, it is important that they rely on robust routing algorithms which ensure connectivity despite such failures. While most modern communication networks feature robust routing mechanisms, these mechanisms are often fairly complex to design and verify, as they need to account for the effects of failures and rerouting on communication. This paper revisits the design of robust routing mechanisms, with the aim to avoid such complexity. In particular, we showcase simple and generic blackbox transformations that increase resilience of routing against independently distributed failures, which allows to simulate the routing scheme on the original network, even in the presence of non-benign node failures (henceforth called faults). This is attractive as the system specification and routing policy can simply be preserved. We present a scheme for constructing such a reinforced network, given an existing (synchronous) network and a routing scheme. We prove that this algorithm comes with small constant overheads, and only requires a minimal amount of additional node and edge resources. At the same time, it allows to tolerate a large number of independent random (node) faults, asymptotically almost surely. We complement our analytical results with simulations on different real-world topologies.
翻译:随着通信网络规模的扩大,失败的可能性也随之增加。由于这些网络是我们数字社会的关键主干,因此,重要的是,它们要依赖强有力的路由算法,确保连接性。尽管大多数现代通信网络都设有强有力的路由机制,但这些机制在设计和核查方面往往相当复杂,因为它们需要说明失败和通信改道的影响。本文回顾了强有力的路由机制的设计,目的是避免这种复杂性。特别是,我们展示了简单和通用黑盒转换,提高了路由独立分布的故障的复原力,从而可以模拟原始网络的路线安排,即使出现非恶意节点故障(因此称为断层),这也具有吸引力,因为系统规格和路由政策可以简单地保留。我们提出了一个建立这种强化的网络的计划,考虑到现有的(同步)网络和路由式计划。我们证明,这种算法是小规模的固定间接费用,只需要少量的额外节点和边缘资源。我们几乎可以独立地在原始网络上模拟结果。与此同时,它可以随机地容忍一个巨大的、真实的模型。