We consider a distributed storage system with $n$ nodes, where a user can recover the stored file from any $k$ nodes, and study the problem of repairing $r$ partially failed nodes. We consider \textit{broadcast repair}, that is, $d$ surviving nodes transmit broadcast messages on an error-free wireless channel to the $r$ nodes being repaired, which are then used, together with the surviving data in the local memories of the failed nodes, to recover the lost content. First, we derive the trade-off between the storage capacity and the repair bandwidth for partial repair of multiple failed nodes, based on the cut-set bound for information flow graphs. It is shown that utilizing the broadcast nature of the wireless medium and the surviving contents at the partially failed nodes reduces the repair bandwidth per node significantly. Then, we list a set of invariant conditions that are sufficient for a functional repair code to be feasible. We further propose a scheme for functional repair of multiple failed nodes that satisfies the invariant conditions with high probability, and its extension to the repair of partial failures. The performance of the proposed scheme meets the cut-set bound on all the points on the trade-off curve for all admissible parameters when $k$ is divisible by $r$, while employing linear subpacketization, which is an important practical consideration in the design of distributed storage codes. Unlike random linear codes, which are conventionally used for functional repair of failed nodes, the proposed repair scheme has lower overhead, lower input-output cost, and lower computational complexity during repair.
翻译:我们考虑的是含有美元节点的分布式存储系统,用户可以从任何美元节点中回收存储的存储文件,并研究修复部分失败节点的问题。我们考虑的是“textit{broadcast maid mail ”,即“尚存的节点”将无错误无线频道上的广播信息传输给正在修复的节点,然后使用这些节点,加上未成功节点当地记忆中的剩余数据,以收回内容。首先,我们根据信息流图的切开式连接,将存储能力和部分失败节点部分修复的精密点的精密点的精密点进行交换。我们考虑的是,利用无偏差的节点的广播性质和部分失败内容大大降低了无遗漏的节点。然后,我们列出了一套足以使功能性修复代码变得可行的变化性条件。我们进一步提议了一个功能性修复多个不成功的节点的计划,该节点满足了变化状态的高概率,并将其扩展为部分失败部分失败的节点,以信息流流路点为约束。在使用常规设计规则时,所有正常交易规则的性规则的性都满足了标准。