We continue our study of regenerating codes in distributed storage systems where connections between the nodes are constrained by a graph. In this problem, the failed node downloads the information stored at a subset of vertices of the graph for the purpose of recovering the lost data. This information is moved across the network, and the cost of node repair is determined by the graphical distance from the helper nodes to the failed node. This problem was formulated in our recent work (IEEE IT Transactions, May 2022) where we showed that processing of the information at the intermediate nodes can yield savings in repair bandwidth over the direct forwarding of the data. While the previous paper was limited to the MSR case, here we extend our study to the case of general regenerating codes. We derive a lower bound on the repair bandwidth and formulate repair procedures with intermediate processing for several families of regenerating codes, with an emphasis on the recent constructions from multilinear algebra. We also consider the task of data retrieval for codes on graphs, deriving a lower bound on the communication bandwidth and showing that it can be attained at the MBR point of the storage-bandwidth tradeoff curve.
翻译:我们继续研究在分布式储存系统中重新生成代码,因为节点之间的连接受到图表的限制。 在这个问题中,失败的节点下载了在图表的一个脊椎子上储存的信息,以收回丢失的数据。这些信息在整个网络中移动,节点修复的费用由从帮助节点到失败节点的图形距离决定。这个问题在我们最近的工作(IEEEE IT Transaction, 2022年5月)中提出,我们在中间节点处理信息可以节省数据直接传输的频带的修理费用。上一份节点仅限于MSR案例,但我们在此将我们的研究扩大到一般再生码案例。我们对修理带宽进行较低的约束,并为若干再生代码家庭制定中间处理程序,重点是最近从多线性阿尔格布拉的构造。我们还考虑对图表代码进行数据检索的任务,在通信带宽上得出较低的约束,并表明可以在存储带贸易曲线的MBR点实现。