Coded distributed computing (CDC) can trade extra computing power to reduce the communication load for the MapReduce-type systems. The optimal computation-communication tradeoff has been well studied for homogeneous systems, and some results have also been obtained under the heterogeneous condition in recent studies. However, the previous works allow the file placement and Reduce function assignment free to design for the scheme. In this paper, we consider the general heterogeneous MapReduce system, where the file placement and Reduce function assignment are arbitrary but prefixed among all nodes (i.e., can not be designed by the scheme), and the storage and the computational capabilities for different nodes are not necessarily equal. We propose two {universal} CDC schemes and establish upper bounds of the optimal communication load. The first achievable scheme, namely One-Shot Coded Transmission (OSCT), encodes the intermediate values (IVs) into message blocks with different sizes to exploit the multicasting gain, and each message block can be decoded independently by the intended nodes. The second scheme, namely Few-Shot Coded Transmission (FSCT), splits IVs into smaller pieces and each node jointly decodes multiple message blocks to obtain its desired IVs. We prove that our OSCT and FSCT are optimal in many cases, and give sufficient conditions for the optimality of OSCT and FSCT, respectively.
翻译:代码化的分布计算(CDC)可以交换额外的计算能力,以减少马普图类系统的通信负荷。最佳计算-通信交换功能已经为同质系统进行了很好的研究,最近的研究还根据各种条件取得了一些结果。然而,以前的工程允许文件放置并减少功能分配,以便设计这个方案。在本文中,我们认为一般的混杂的马普图显示系统,其中文件放置和减少功能分配是任意的,但在所有节点(即无法由计划设计)之间预设的计算能力,不同节点的存储和计算能力不一定相等。我们提出了两种{普遍}CDC方案,并确定了最佳通信负荷的上限。第一个可实现的方案,即一流编码传输(OSCT),将中间值(IVs)编码为不同大小的信息块,以利用多投影收益,每个信息块可以独立地解码(即无法由计划设计),第二个方案,即微小编码化的编码化的编码化系统传输(FSFSC)的存储能力和计算能力不一定相等的多面块,我们所希望的四号和FSFSDFC系统将每一个都分为四号。