理论分析非Convex储存分散化优化的原始-二元计算法的理论分析 (Theoretical Analysis of Primal-Dual Algorithm for Non-Convex Stochastic Decentralized Optimization) - 专知论文

会员服务 ·

0

Learning · Analysis · Gossip协议 · 目标函数 · 稳健性 ·

2022 年 8 月 4 日

Theoretical Analysis of Primal-Dual Algorithm for Non-Convex Stochastic Decentralized Optimization

翻译：理论分析非Convex储存分散化优化的原始-二元计算法的理论分析

Yuki Takezawa,Kenta Niwa,Makoto Yamada

In recent years, decentralized learning has emerged as a powerful tool not only for large-scale machine learning, but also for preserving privacy. One of the key challenges in decentralized learning is that the data distribution held by each node is statistically heterogeneous. To address this challenge, the primal-dual algorithm called the Edge-Consensus Learning (ECL) was proposed and was experimentally shown to be robust to the heterogeneity of data distributions. However, the convergence rate of the ECL is provided only when the objective function is convex, and has not been shown in a standard machine learning setting where the objective function is non-convex. Furthermore, the intuitive reason why the ECL is robust to the heterogeneity of data distributions has not been investigated. In this work, we first investigate the relationship between the ECL and Gossip algorithm and show that the update formulas of the ECL can be regarded as correcting the local stochastic gradient in the Gossip algorithm. Then, we propose the Generalized ECL (G-ECL), which contains the ECL as a special case, and provide the convergence rates of the G-ECL in both (strongly) convex and non-convex settings, which do not depend on the heterogeneity of data distributions. Through synthetic experiments, we demonstrate that the numerical results of both the G-ECL and ECL coincide with the convergence rate of the G-ECL.

翻译：近年来,分散化学习不仅成为大规模机器学习的有力工具,而且也成为保护隐私的有力工具。分散化学习的主要挑战之一是每个节点掌握的数据分布在统计上是多种多样的。为了应对这一挑战,提出了称为“边缘-consensus Learning (ECL)”的原始双算法,并实验性地证明它对于数据分布的异质性具有很强的作用。然而,ECL的趋同率只有在目标函数为Convex时才能提供,而没有在标准机学习设置中显示目标函数为非连接值。此外,没有调查ECL对数据分布的异质性具有很强的直观原因。在这项工作中,我们首先调查ECL和Gossip算法之间的关系,并表明EC的更新公式可被视为纠正Gosip算法中本地的随机梯度。然后,我们提议通用ECL(G-ECLL)的趋同性(EC-L),其中包含EC的趋同性率,而GL的趋同性率则显示GL的不及GL的趋同性。

0

相关内容

Learning

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

非凸稀疏正则化模型与算法的研究

国家自然科学基金

3+阅读 · 2015年12月31日

水稻转录因子OsMADS57参与硝酸盐调控根系伸长的机制

国家自然科学基金

0+阅读 · 2014年12月31日

化学掺杂对石墨烯量子点结构及性质的调控研究

国家自然科学基金

0+阅读 · 2014年12月31日

几何结构形变空间的几何拓扑

国家自然科学基金

0+阅读 · 2012年12月31日

受限制策略下多臂Bandit过程的理论与应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

MicroRNA-145调节骨关节炎软骨胞外基质代谢失衡的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

氮杂环卡宾催化的新型有机反应研究

国家自然科学基金

0+阅读 · 2011年12月31日

铟掺杂方钴矿基热电材料的电热协同输运效应及机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

非对易空间和非对易相空间中的量子物理

国家自然科学基金

0+阅读 · 2009年12月31日

MicroRNA在HBV感染中作用机理的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Generalizing Bayesian Optimization with Decision-theoretic Entropies

Arxiv

0+阅读 · 2022年10月4日

Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees

Arxiv

0+阅读 · 2022年10月4日

Decentralized Distributed Optimization for Saddle Point Problems

Arxiv

0+阅读 · 2022年10月3日

High Probability Convergence for Accelerated Stochastic Mirror Descent

Arxiv

0+阅读 · 2022年10月3日

Privacy-preserving Decentralized Federated Learning over Time-varying Communication Graph

Arxiv

0+阅读 · 2022年10月1日

Primal-dual regression approach for Markov decision processes with general state and action space

Arxiv

0+阅读 · 2022年10月1日

Randomized quasi-optimal local approximation spaces in time

Arxiv

0+阅读 · 2022年10月1日

Pitfalls of Gaussians as a noise distribution in NCE

Arxiv

0+阅读 · 2022年10月1日

Online Multi-Agent Decentralized Byzantine-robust Gradient Estimation

Arxiv

0+阅读 · 2022年9月30日

Neighborhood Gradient Clustering: An Efficient Decentralized Learning Method for Non-IID Data Distributions

Arxiv

0+阅读 · 2022年9月30日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Generalizing Bayesian Optimization with Decision-theoretic Entropies

Arxiv

0+阅读 · 2022年10月4日

Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees

Arxiv

0+阅读 · 2022年10月4日

Decentralized Distributed Optimization for Saddle Point Problems

Arxiv

0+阅读 · 2022年10月3日

High Probability Convergence for Accelerated Stochastic Mirror Descent

Arxiv

0+阅读 · 2022年10月3日

Privacy-preserving Decentralized Federated Learning over Time-varying Communication Graph

Arxiv

0+阅读 · 2022年10月1日

Primal-dual regression approach for Markov decision processes with general state and action space

Arxiv

0+阅读 · 2022年10月1日

Randomized quasi-optimal local approximation spaces in time

Arxiv

0+阅读 · 2022年10月1日

Pitfalls of Gaussians as a noise distribution in NCE

Arxiv

0+阅读 · 2022年10月1日

Online Multi-Agent Decentralized Byzantine-robust Gradient Estimation

Arxiv

0+阅读 · 2022年9月30日

Neighborhood Gradient Clustering: An Efficient Decentralized Learning Method for Non-IID Data Distributions

Arxiv

0+阅读 · 2022年9月30日

相关基金

非凸稀疏正则化模型与算法的研究

国家自然科学基金

3+阅读 · 2015年12月31日

水稻转录因子OsMADS57参与硝酸盐调控根系伸长的机制

国家自然科学基金

0+阅读 · 2014年12月31日

化学掺杂对石墨烯量子点结构及性质的调控研究

国家自然科学基金

0+阅读 · 2014年12月31日

几何结构形变空间的几何拓扑

国家自然科学基金

0+阅读 · 2012年12月31日

受限制策略下多臂Bandit过程的理论与应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

MicroRNA-145调节骨关节炎软骨胞外基质代谢失衡的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

氮杂环卡宾催化的新型有机反应研究

国家自然科学基金

0+阅读 · 2011年12月31日

铟掺杂方钴矿基热电材料的电热协同输运效应及机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

非对易空间和非对易相空间中的量子物理

国家自然科学基金

0+阅读 · 2009年12月31日

MicroRNA在HBV感染中作用机理的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员