分散的 SGD 普遍化 (Topology-aware Generalization of Decentralized SGD) - 专知论文

会员服务 ·

0

泛化理论 · SGD · HTTPS · 随机梯度下降 · 相关系数 ·

2023 年 2 月 4 日

Topology-aware Generalization of Decentralized SGD

翻译：分散的 SGD 普遍化

Tongtian Zhu,Fengxiang He,Lan Zhang,Zhengyang Niu,Mingli Song,Dacheng Tao

from arxiv, Accepted for publication in the 39th International Conference on Machine Learning (ICML 2022)

This paper studies the algorithmic stability and generalizability of decentralized stochastic gradient descent (D-SGD). We prove that the consensus model learned by D-SGD is $\mathcal{O}{(N^{-1}+m^{-1} +\lambda^2)}$-stable in expectation in the non-convex non-smooth setting, where $N$ is the total sample size, $m$ is the worker number, and $1+\lambda$ is the spectral gap that measures the connectivity of the communication topology. These results then deliver an $\mathcal{O}{(N^{-(1+\alpha)/2}+ m^{-(1+\alpha)/2}+\lambda^{1+\alpha} + \phi_{\mathcal{S}})}$ in-average generalization bound, which is non-vacuous even when $\lambda$ is closed to $1$, in contrast to vacuous as suggested by existing literature on the projected version of D-SGD. Our theory indicates that the generalizability of D-SGD is positively correlated with the spectral gap, and can explain why consensus control in initial training phase can ensure better generalization. Experiments of VGG-11 and ResNet-18 on CIFAR-10, CIFAR-100 and Tiny-ImageNet justify our theory. To our best knowledge, this is the first work on the topology-aware generalization of vanilla D-SGD. Code is available at https://github.com/Raiden-Zhu/Generalization-of-DSGD.

翻译：本文研究了分散式随机梯度下降(D-SGD)的算法稳定性和可概括性。我们证明D-SGD所学的协商一致模式是$\mathcal{O}(N ⁇ -1 ⁇ m ⁇ -1} ⁇ ⁇ lambda ⁇ 2}}}在非cavex非soot 环境下,在非cavex 非smooth 环境下,美元是总样本规模, 美元是美元, 美元是工人数字, 美元是测量通信表层连接的光谱差距。这些结果随后提供了$mathcal{O}(N ⁇ -(1 ⁇ - ALpha)/2 ⁇ (m ⁇ ) ⁇ (1 ⁇ ) ⁇ (1 ⁇ ) ⁇ (1 ⁇ ) ⁇ (1 ⁇ ) ⁇ (1 ⁇ (1 ⁇ ) ⁇ (1 ⁇ ) ⁇ (1 ⁇ ) ⁇ (1 ⁇ )-1 ⁇ )/ ⁇ }]美元,在非clamda美元关闭时, 与现有关于D-SG-SGD版本的文献显示,D-18的理论表明,D-S-S-S-S-S-S-S-SAR-S-S-S-SAR-S-S-S-S-S-SAR-S-S-SQ-SQ-SQ-SQ-SAR-SAR-SAR-SAR-SAR-SAR-SAR-SAR-S-SAR-S-S-S-S-S-S-SQ-SAR-SAR-SQ-SAR-SBAR-SBAR-SBAR-SB-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-SAR-SAR-SAR-SAR-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S

0

相关内容

泛化理论

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

免疫调控蛋白ABIN1抑制TNF诱导细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

S@TiO2纳米颗粒/纳米管正极材料的设计合成及其固硫机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型微孔金属-有机膦酸材料的合成及催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

熔盐电解可控制备纳米半导体(Si, Ge)粉体的基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

两个WD40转录因子对银杏类黄酮生物合成调控的研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属-有机骨架化合物（MOFs）的手性后合成修饰及不对称催化研究

国家自然科学基金

0+阅读 · 2012年12月31日

无溶剂合成介孔硅铝催化材料

国家自然科学基金

0+阅读 · 2012年12月31日

一氧化氮调控铝诱导花生根尖细胞程序性死亡机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型多元铟硫属化合物的溶剂热合成及性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型手性N-Oxide金属化合物的合成与催化研究

国家自然科学基金

0+阅读 · 2008年12月31日

Statistical Inference with Stochastic Gradient Methods under $φ$-mixing Data

Arxiv

0+阅读 · 2023年3月28日

CoDeC: Communication-Efficient Decentralized Continual Learning

Arxiv

0+阅读 · 2023年3月27日

A New Family of Generalization Bounds Using Samplewise Evaluated CMI

Arxiv

0+阅读 · 2023年3月27日

Risk-aware linear bandits with convex loss

Arxiv

0+阅读 · 2023年3月27日

On Generalization of Decentralized Learning with Separable Data

Arxiv

0+阅读 · 2023年3月27日

On the tightness of information-theoretic bounds on generalization error of learning algorithms

Arxiv

0+阅读 · 2023年3月26日

IMA-GNN: In-Memory Acceleration of Centralized and Decentralized Graph Neural Networks at the Edge

IMA-GNN: In-Memory Acceleration of Centralized and Decentralized Graph Neural Networks at the Edge

Arxiv

0+阅读 · 2023年3月24日

The limited-memory recursive variational Gaussian approximation (L-RVGA)

Arxiv

0+阅读 · 2023年3月24日

Efficient decentralized multi-agent learning in asymmetric bipartite queueing systems

Arxiv

0+阅读 · 2023年3月23日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

VIP会员

文章信息

相关主题

随机梯度下降

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Statistical Inference with Stochastic Gradient Methods under $φ$-mixing Data

Arxiv

0+阅读 · 2023年3月28日

CoDeC: Communication-Efficient Decentralized Continual Learning

Arxiv

0+阅读 · 2023年3月27日

A New Family of Generalization Bounds Using Samplewise Evaluated CMI

Arxiv

0+阅读 · 2023年3月27日

Risk-aware linear bandits with convex loss

Arxiv

0+阅读 · 2023年3月27日

On Generalization of Decentralized Learning with Separable Data

Arxiv

0+阅读 · 2023年3月27日

On the tightness of information-theoretic bounds on generalization error of learning algorithms

Arxiv

0+阅读 · 2023年3月26日

IMA-GNN: In-Memory Acceleration of Centralized and Decentralized Graph Neural Networks at the Edge

IMA-GNN: In-Memory Acceleration of Centralized and Decentralized Graph Neural Networks at the Edge

Arxiv

0+阅读 · 2023年3月24日

The limited-memory recursive variational Gaussian approximation (L-RVGA)

Arxiv

0+阅读 · 2023年3月24日

Efficient decentralized multi-agent learning in asymmetric bipartite queueing systems

Arxiv

0+阅读 · 2023年3月23日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

相关基金

免疫调控蛋白ABIN1抑制TNF诱导细胞凋亡的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

S@TiO2纳米颗粒/纳米管正极材料的设计合成及其固硫机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型微孔金属-有机膦酸材料的合成及催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

熔盐电解可控制备纳米半导体(Si, Ge)粉体的基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

两个WD40转录因子对银杏类黄酮生物合成调控的研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属-有机骨架化合物（MOFs）的手性后合成修饰及不对称催化研究

国家自然科学基金

0+阅读 · 2012年12月31日

无溶剂合成介孔硅铝催化材料

国家自然科学基金

0+阅读 · 2012年12月31日

一氧化氮调控铝诱导花生根尖细胞程序性死亡机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型多元铟硫属化合物的溶剂热合成及性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型手性N-Oxide金属化合物的合成与催化研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员