利用云团分布式计算 (Distributed Computing With the Cloud)

We investigate the effect of omnipresent cloud storage on distributed computing. We specify a network model with links of prescribed bandwidth that connect standard processing nodes, and, in addition, passive storage nodes. Each passive node represents a cloud storage system, such as Dropbox, Google Drive etc. We study a few tasks in this model, assuming a single cloud node connected to all other nodes, which are connected to each other arbitrarily. We give implementations for basic tasks of collaboratively writing to and reading from the cloud, and for more advanced applications such as matrix multiplication and federated learning. Our results show that utilizing node-cloud links as well as node-node links can considerably speed up computations, compared to the case where processors communicate either only through the cloud or only through the network links. We provide results for general directed graphs, and for graphs with ``fat'' links between processing nodes. For the general case, we provide optimal algorithms for uploading and downloading files using flow techniques. We use these primitives to derive algorithms for \emph{combining}, where every processor node has an input value and the task is to compute a combined value under some given associative operator. In the case of fat links, we assume that links between processors are bidirectional and have high bandwidth, and we give near-optimal algorithms for any commutative combining operator (such as vector addition). For the task of matrix multiplication (or other non-commutative combining operators), where the inputs are ordered, we present sharp results in the simple ``wheel'' network, where procesing nodes are arranged in a ring, and are all connected to a single cloud node.

翻译：我们调查分布式计算中天文云存储的影响。我们指定了连接标准处理节点和被动存储节点的指定带宽链接的网络模型。每个被动节点代表云存储系统, 如 Drobox、 Google驱动器等。我们研究模型中的一些任务, 假设与其他所有节点连接的单一云节点, 这些节点是任意连接的。对于一般案例, 我们为从云层上和从云层上进行协作写作和读取的基本任务, 以及更先进的应用程序, 如矩阵倍增和粘合学习。我们的结果显示, 使用节点和节点连接连接的节点链接以及节点和节点连接可以大大加快计算速度, 相比之下, 处理者只能通过云端或仅通过网络链接进行交流。我们为通用的图表提供结果, 与处理节点之间连接的图表。对于一般案例, 我们为上传和从云中下载的文件提供了最优化的算法, 我们使用这些原始算法来为某些 emph{combing 以及节点连接起来。