NPS:非统一参数存取机器学习参数服务器 (NuPS: A Parameter Server for Machine Learning with Non-Uniform Parameter Access)

Parameter servers (PSs) facilitate the implementation of distributed training for large machine learning tasks. In this paper, we argue that existing PSs are inefficient for tasks that exhibit non-uniform parameter access; their performance may even fall behind that of single node baselines. We identify two major sources of such non-uniform access: skew and sampling. Existing PSs are ill-suited for managing skew because they uniformly apply the same parameter management technique to all parameters. They are inefficient for sampling because the PS is oblivious to the associated randomized accesses and cannot exploit locality. To overcome these performance limitations, we introduce NuPS, a novel PS architecture that (i) integrates multiple management techniques and employs a suitable technique for each parameter and (ii) supports sampling directly via suitable sampling primitives and sampling schemes that allow for a controlled quality--efficiency trade-off. In our experimental study, NuPS outperformed existing PSs by up to one order of magnitude and provided up to linear scalability across multiple machine learning tasks.

翻译：参数服务器(PS) 有利于执行大型机器学习任务的分布式培训。在本文中, 我们争论说, 现有的 PS 对于显示非统一参数访问量的任务来说效率低; 其性能甚至可能低于单一节点基线; 我们确定了这种非统一访问量的两个主要来源: 斜线和取样。现有的 PS 不适合管理斜线, 因为它们对所有参数都统一应用相同的参数管理技术。它们对于取样来说效率不高, 因为 PS 忽略了相关的随机访问量, 无法利用地点。为了克服这些性能限制, 我们引入了 NuPS, 这是一种新型的 PS 结构, (一) 整合多种管理技术, 并采用适合每个参数的合适技术, (二) 通过合适的取样原始技术和取样计划直接支持取样, 从而实现有控制的质效率交易。在我们的实验研究中, NuPS 超越了现有的 PS, 其规模最高为一等, 并且提供了跨多个机器学习任务的线性伸缩性。

相关内容

Machine Learning

关注 2240

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【经典书】机器学习黑客秘笈(Machine Learning for Hackers)，322页pdf

专知会员服务

46+阅读 · 2021年2月8日

【机器学习工具箱(机器学习实用库分类大列表)】《Machine Learning Toolbox》by Amit Chaudhary

专知会员服务

30+阅读 · 2020年7月12日

专知会员服务

170+阅读 · 2020年5月10日

【伯克利】机器学习蛋白质工程，Machine learning for protein engineering，83页ppt