利用泰勒接近梯度改进Frank-Wolfe方法,以尽量减少经验风险 (Using Taylor-Approximated Gradients to Improve the Frank-Wolfe Method for Empirical Risk Minimization) - 专知论文

会员服务 ·

0

经验风险最小化 · 经验风险 · 可约的 · Projection · 优化器 ·

2022 年 8 月 30 日

Using Taylor-Approximated Gradients to Improve the Frank-Wolfe Method for Empirical Risk Minimization

翻译：利用泰勒接近梯度改进Frank-Wolfe方法,以尽量减少经验风险

Zikai Xiong,Robert M. Freund

from arxiv, 27 pages, 2 figures

The Frank-Wolfe method has become increasingly useful in statistical and machine learning applications, due to the structure-inducing properties of the iterates, and especially in settings where linear minimization over the feasible set is more computationally efficient than projection. In the setting of Empirical Risk Minimization -- one of the fundamental optimization problems in statistical and machine learning -- the computational effectiveness of Frank-Wolfe methods typically grows linearly in the number of data observations $n$. This is in stark contrast to the case for typical stochastic projection methods. In order to reduce this dependence on $n$, we look to second-order smoothness of typical smooth loss functions (least squares loss and logistic loss, for example) and we propose amending the Frank-Wolfe method with Taylor series-approximated gradients, including variants for both deterministic and stochastic settings. Compared with current state-of-the-art methods in the regime where the optimality tolerance $\varepsilon$ is sufficiently small, our methods are able to simultaneously reduce the dependence on large $n$ while obtaining optimal convergence rates of Frank-Wolfe methods, in both the convex and non-convex settings. We also propose a novel adaptive step-size approach for which we have computational guarantees. Last of all, we present computational experiments which show that our methods exhibit very significant speed-ups over existing methods on real-world datasets for both convex and non-convex binary classification problems.

翻译：Frank-Wolfe 方法在统计和机器学习应用中越来越有用,因为迭代在结构上具有启发性,特别是在一些环境中,在可行数据集上线性最小化比预测效率更具有计算效率。在确定 " 经验风险最小化 " -- -- 统计和机器学习中最根本的优化问题之一 -- -- 时,Frank-Wolfe 方法的计算效力通常在数据观测数量上线性地增长。这与典型的随机投影方法相比是明显不同的。为了减少对美元的依赖,我们期待对典型的平滑损失功能(例如,东部广场损失和后勤损失)的第二阶级平稳性,我们提议修改Frank-Wolfe 方法,采用泰勒系列近似梯度梯度的方法,包括确定性和随机环境的变异体。与当前最先进的方法相比,在最优化的容忍度 $\ varepslonlon 方法方面,我们的方法能够同时减少对大额美元的依赖,同时获得最优化的平滑损率(比如,比如,比如,如,最小广场损失和后勤损失损失损失损失) 以及后勤损失) 和最优化的折叠计算方法,我们提出的最优化的超前的推算法式方法。

0

相关内容

经验风险最小化

经验风险最小化

经验风险最小化（ERM）是统计学习理论中的一个原则，它定义了一系列学习算法，并用于给出其性能的理论界限。经验风险最小化的策略认为，经验风险最小的模型是最优的模型。根据这一策略，按照经验风险最小化求最优模型就是求解最优化问题。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

92+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Heisenberg群与Minkowski空间中的非线性椭圆方程

国家自然科学基金

0+阅读 · 2014年12月31日

集值优化问题的逼近解及二阶最优性条件

国家自然科学基金

0+阅读 · 2014年12月31日

基于吸附树脂/导电聚合物复合材料构建痕量内分泌干扰物的化学传感器

国家自然科学基金

0+阅读 · 2014年12月31日

非ABA依赖型SnRK2激酶调控马铃薯响应干旱胁迫的机制解析

国家自然科学基金

0+阅读 · 2014年12月31日

一维量子简并气体关联与临界性质研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

lncRNA-UCA1通过PKM2参与膀胱癌细胞Warburg效应的机制

国家自然科学基金

0+阅读 · 2012年12月31日

亚砷酸钠对血管内皮祖细胞修复能力的效应及其分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

McMullen函数族及其推广的动力系统

国家自然科学基金

0+阅读 · 2011年12月31日

基于hUCB-MSCs的原位分层多基因增强Mosaicplasty重建灵长类动物大面积骨软骨复合性损伤

国家自然科学基金

0+阅读 · 2008年12月31日

Inference for parameters identified by conditional moment restrictions using a penalized Bierens maximum statistic

Inference for parameters identified by conditional moment restrictions using a penalized Bierens maximum statistic

Arxiv

0+阅读 · 2022年10月18日

Sampling and Update Frequencies in Proximal Variance-Reduced Stochastic Gradient Methods

Arxiv

0+阅读 · 2022年10月18日

Nonlinear Invariant Risk Minimization: A Causal Approach

Arxiv

0+阅读 · 2022年10月18日

A Better Way to Decay: Proximal Gradient Training Algorithms for Neural Nets

Arxiv

0+阅读 · 2022年10月17日

Deterministic particle flows for constraining SDEs

Arxiv

0+阅读 · 2022年10月17日

Hyper-differential sensitivity analysis with respect to model discrepancy: Calibration and optimal solution updating

Arxiv

0+阅读 · 2022年10月17日

A Deep Learning Approach to Nonconvex Energy Minimization for Martensitic Phase Transitions

Arxiv

0+阅读 · 2022年10月15日

Inverse Problems for Subdiffusion from Observation at an Unknown Terminal Time

Arxiv

0+阅读 · 2022年10月14日

Continuous-in-time Limit for Bayesian Bandits

Arxiv

0+阅读 · 2022年10月14日

DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps

Arxiv

0+阅读 · 2022年10月13日

VIP会员

文章信息

相关主题

经验风险最小化

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

92+阅读 · 2020年2月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

卫星导航技术发展综述

《美军"僚机"联合能力技术演示项目：有人-无人火炮作战》41页报告

美军条令《火力指挥》116页

可解释的人工智能在生物医学图像分析中的应用综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Inference for parameters identified by conditional moment restrictions using a penalized Bierens maximum statistic

Inference for parameters identified by conditional moment restrictions using a penalized Bierens maximum statistic

Arxiv

0+阅读 · 2022年10月18日

Sampling and Update Frequencies in Proximal Variance-Reduced Stochastic Gradient Methods

Arxiv

0+阅读 · 2022年10月18日

Nonlinear Invariant Risk Minimization: A Causal Approach

Arxiv

0+阅读 · 2022年10月18日

A Better Way to Decay: Proximal Gradient Training Algorithms for Neural Nets

Arxiv

0+阅读 · 2022年10月17日

Deterministic particle flows for constraining SDEs

Arxiv

0+阅读 · 2022年10月17日

Hyper-differential sensitivity analysis with respect to model discrepancy: Calibration and optimal solution updating

Arxiv

0+阅读 · 2022年10月17日

A Deep Learning Approach to Nonconvex Energy Minimization for Martensitic Phase Transitions

Arxiv

0+阅读 · 2022年10月15日

Inverse Problems for Subdiffusion from Observation at an Unknown Terminal Time

Arxiv

0+阅读 · 2022年10月14日

Continuous-in-time Limit for Bayesian Bandits

Arxiv

0+阅读 · 2022年10月14日

DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps

Arxiv

0+阅读 · 2022年10月13日

相关基金

Heisenberg群与Minkowski空间中的非线性椭圆方程

国家自然科学基金

0+阅读 · 2014年12月31日

集值优化问题的逼近解及二阶最优性条件

国家自然科学基金

0+阅读 · 2014年12月31日

基于吸附树脂/导电聚合物复合材料构建痕量内分泌干扰物的化学传感器

国家自然科学基金

0+阅读 · 2014年12月31日

非ABA依赖型SnRK2激酶调控马铃薯响应干旱胁迫的机制解析

国家自然科学基金

0+阅读 · 2014年12月31日

一维量子简并气体关联与临界性质研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

lncRNA-UCA1通过PKM2参与膀胱癌细胞Warburg效应的机制

国家自然科学基金

0+阅读 · 2012年12月31日

亚砷酸钠对血管内皮祖细胞修复能力的效应及其分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

McMullen函数族及其推广的动力系统

国家自然科学基金

0+阅读 · 2011年12月31日

基于hUCB-MSCs的原位分层多基因增强Mosaicplasty重建灵长类动物大面积骨软骨复合性损伤

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员