Partitioning large matrices is an important problem in distributed linear algebra computing (used in ML among others). Briefly, our goal is to perform a sequence of matrix algebra operations in a distributed manner (whenever possible) on these large matrices. However, not all partitioning schemes work well with different matrix algebra operations and their implementations (algorithms). This is a type of data tiling problem. In this work we consider a theoretical model for a version of the matrix tiling problem in the setting of hypergraph labeling. We prove some hardness results and give a theoretical characterization of its complexity on random instances. Additionally we develop a greedy algorithm and experimentally show its efficacy.
翻译:在分布式线性代数计算中,大矩阵分割是一个重要问题(用于ML等)。简而言之,我们的目标是在这些大矩阵上以分布式方式(尽可能)进行矩阵代数操作的顺序。然而,并非所有的分隔方案都与不同的矩阵代数操作及其实施(等分数)运作良好。这是一个数据拼拼法问题的类型。在这项工作中,我们考虑在高光标标签设置中采用矩阵打字问题版本的理论模型。我们证明了一些硬性结果,并在随机实例中从理论上描述其复杂性。此外,我们开发了一种贪婪的算法并实验性地展示了其有效性。