Gradient clock synchronization (GCS) algorithms minimize the worst-case clock offset between the nodes in a distributed network of diameter $D$ and size $n$. They achieve optimal offsets of $\Theta(\log D)$ locally, i.e. between adjacent nodes as shown by Lenzen et al., and $\Theta(D)$ globally as shown by Biaz and Welch. As demonstrated in the work of Bund et al., this is a highly promising approach for improved clocking schemes for large-scale synchronous Systems-on-Chip (SoC). Unfortunately, in large systems, faults hinder their practical use. State of the art fault-tolerant, as presented by Bund et al., has a drawback that is fatal in this setting: It relies on node and edge replication. For $f=1$, this translates to at least $16$-fold edge replication and high degree nodes, far from the optimum of $2f+1=3$ for tolerating up to $f$ faulty neighbors. In this work, we present a self-stabilizing GCS algorithm for a grid-like directed graph with optimal node in- and out-degrees of $3$ that tolerates $1$ faulty in-neighbor. If nodes fail with independent probability $p\in o(n^{-1/2})$, it achieves asymptotically optimal local skew of $\Theta(\log D)$ with probability $1-o(1)$; this holds under general worst-case assumptions on link delay and clock speed variations, provided they change slowly relative to the speed of the system. The failure probability is the largest possible ensuring that with probabity $1-o(1)$ for each node at most one in-neighbor fails. As modern hardware is clocked at gigahertz speeds and the algorithm can simultaneously sustain a constant number of arbitrary changes due to faults in each clock cycle, this results in sufficient robustness to dramatically increase the size of reliable synchronously clocked SoCs.
翻译:重度时钟同步( GCS) 算法将最坏的时钟折合在一个直径为$D$和大小为$n$的分布式网络中的节点之间的最坏的时钟冲抵最小化。 不幸的是, 在大型系统中, 错误会阻碍这些节点的实际使用。 Bund 和 al. 所展示的艺术过错度状态在这种环境下是致命的: 它依赖于节点和边缘复制。 $f=1, 这相当于至少16美元的边缘复制和高度节点。 在Bund 等人的工作中, 这是一种非常有希望的方法, 改进对大规模同步系统( SoC) 最大同步的节时钟计办法。 不幸的是, 在大型系统中, 错误会阻碍这些节点的实际使用。 在普通和他人看来, 艺术过错的偏差状态会维持在每一个节点上的节点。 $ 美元 。 对于每个节点来说, 节点的节点和Q- sal- sal- dirdal 的变速速度会比 。