在线最不发达国家平方的线性功能的高维中央限制理论 (High-dimensional Central Limit Theorems for Linear Functionals of Online Least-Squares SGD) - 专知论文

会员服务 ·

0

SGD · 在线 · 线性的 · 泛函 · 矩 ·

2023 年 2 月 20 日

High-dimensional Central Limit Theorems for Linear Functionals of Online Least-Squares SGD

翻译：在线最不发达国家平方的线性功能的高维中央限制理论

Bhavya Agrawalla,Krishnakumar Balasubramanian,Promit Ghosal

Stochastic gradient descent (SGD) has emerged as the quintessential method in a data scientist's toolbox. Much progress has been made in the last two decades toward understanding the iteration complexity of SGD (in expectation and high-probability) in the learning theory and optimization literature. However, using SGD for high-stakes applications requires careful quantification of the associated uncertainty. Toward that end, in this work, we establish high-dimensional Central Limit Theorems (CLTs) for linear functionals of online least-squares SGD iterates under a Gaussian design assumption. Our main result shows that a CLT holds even when the dimensionality is of order exponential in the number of iterations of the online SGD, thereby enabling high-dimensional inference with online SGD. Our proof technique involves leveraging Berry-Esseen bounds developed for martingale difference sequences and carefully evaluating the required moment and quadratic variation terms through recent advances in concentration inequalities for product random matrices. We also provide an online approach for estimating the variance appearing in the CLT (required for constructing confidence intervals in practice) and establish consistency results in the high-dimensional setting.

翻译：在数据科学家的工具箱中,蒸发性梯度下降(SGD)已成为典型的方法。在过去20年中,在理解学习理论和优化文献中SGD的迭代复杂性(预期和高概率)方面取得了很大进展。然而,使用SGD进行高吸量应用需要仔细量化相关的不确定性。为此,我们为在线最小方位的 SGD 代谢的线性功能,在高斯设计假设下,为在线最小方位的 SGD 代谢的线性功能建立了高维中央限制理论(CLTs )。我们的主要结果表明,即使在SGD 的代谢量数量呈指数指数变化时,CLT仍然保持着很大的进展,从而使得能够对在线SGD 进行高维度推算。我们的证据技术涉及利用为 Martingale 差异序列开发的Berry-Esseen界限,并通过最近产品随机矩阵中浓度不平等的进展,认真评估所需的时刻和二次变换条件。我们还提供了一种在线方法,用以估计CLT(为建立高度信任间隔确定和高度结果)中的差异。

0

相关内容

SGD

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

间接优化的高效Monte Carlo声传播研究

国家自然科学基金

0+阅读 · 2017年12月31日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

稀土MOF纳米荧光探针的设计合成及其生物应用

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

面向遥感图像高保真压缩的变换与量化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Navier-Stokes方程的三角形cut-cell自适应有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

大型稀疏非对称线性方程组的预处理及高效算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

非线性方程组迭代方法特征研究及并行计算

国家自然科学基金

0+阅读 · 2008年12月31日

过渡金属氧化物单晶的电子自旋共振研究

国家自然科学基金

0+阅读 · 2008年12月31日

Maximum-likelihood Estimators in Physics-Informed Neural Networks for High-dimensional Inverse Problems

Arxiv

0+阅读 · 2023年4月12日

Debiased Inverse Propensity Score Weighting for Estimation of Average Treatment Effects with High-Dimensional Confounders

Arxiv

0+阅读 · 2023年4月12日

Curvature-Aware Derivative-Free Optimization

Arxiv

0+阅读 · 2023年4月12日

Autobidders with Budget and ROI Constraints: Efficiency, Regret, and Pacing Dynamics

Arxiv

0+阅读 · 2023年4月11日

A Data-Driven State Aggregation Approach for Dynamic Discrete Choice Models

Arxiv

0+阅读 · 2023年4月11日

Approximation of Nonlinear Functionals Using Deep ReLU Networks

Arxiv

0+阅读 · 2023年4月10日

Benign Overfitting of Non-Sparse High-Dimensional Linear Regression with Correlated Noise

Arxiv

0+阅读 · 2023年4月8日

Convex Minimization with Integer Minima in $\widetilde O(n^4)$ Time

Arxiv

0+阅读 · 2023年4月7日

Coordinate Linear Variance Reduction for Generalized Linear Programming

Arxiv

0+阅读 · 2023年4月6日

Multi-task Learning of Order-Consistent Causal Graphs

Arxiv

10+阅读 · 2021年11月3日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Maximum-likelihood Estimators in Physics-Informed Neural Networks for High-dimensional Inverse Problems

Arxiv

0+阅读 · 2023年4月12日

Debiased Inverse Propensity Score Weighting for Estimation of Average Treatment Effects with High-Dimensional Confounders

Arxiv

0+阅读 · 2023年4月12日

Curvature-Aware Derivative-Free Optimization

Arxiv

0+阅读 · 2023年4月12日

Autobidders with Budget and ROI Constraints: Efficiency, Regret, and Pacing Dynamics

Arxiv

0+阅读 · 2023年4月11日

A Data-Driven State Aggregation Approach for Dynamic Discrete Choice Models

Arxiv

0+阅读 · 2023年4月11日

Approximation of Nonlinear Functionals Using Deep ReLU Networks

Arxiv

0+阅读 · 2023年4月10日

Benign Overfitting of Non-Sparse High-Dimensional Linear Regression with Correlated Noise

Arxiv

0+阅读 · 2023年4月8日

Convex Minimization with Integer Minima in $\widetilde O(n^4)$ Time

Arxiv

0+阅读 · 2023年4月7日

Coordinate Linear Variance Reduction for Generalized Linear Programming

Arxiv

0+阅读 · 2023年4月6日

Multi-task Learning of Order-Consistent Causal Graphs

Arxiv

10+阅读 · 2021年11月3日

相关基金

间接优化的高效Monte Carlo声传播研究

国家自然科学基金

0+阅读 · 2017年12月31日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

稀土MOF纳米荧光探针的设计合成及其生物应用

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

面向遥感图像高保真压缩的变换与量化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Navier-Stokes方程的三角形cut-cell自适应有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

大型稀疏非对称线性方程组的预处理及高效算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

非线性方程组迭代方法特征研究及并行计算

国家自然科学基金

0+阅读 · 2008年12月31日

过渡金属氧化物单晶的电子自旋共振研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员