通过斯托查斯梯梯级后裔对背景强盗的在线统计推论 (Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent) - 专知论文

会员服务 ·

0

上下文赌博机/上下文老虎机 · 赌博机/老虎机 · 随机梯度下降 · 统计量 · 估计/估计量 ·

2022 年 12 月 30 日

Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent

翻译：通过斯托查斯梯梯级后裔对背景强盗的在线统计推论

Xi Chen,Zehua Lai,He Li,Yichen Zhang

With the fast development of big data, it has been easier than before to learn the optimal decision rule by updating the decision rule recursively and making online decisions. We study the online statistical inference of model parameters in a contextual bandit framework of sequential decision-making. We propose a general framework for online and adaptive data collection environment that can update decision rules via weighted stochastic gradient descent. We allow different weighting schemes of the stochastic gradient and establish the asymptotic normality of the parameter estimator. Our proposed estimator significantly improves the asymptotic efficiency over the previous averaged SGD approach via inverse probability weights. We also conduct an optimality analysis on the weights in a linear regression setting. We provide a Bahadur representation of the proposed estimator and show that the remainder term in the Bahadur representation entails a slower convergence rate compared to classical SGD due to the adaptive data collection.

翻译：随着海量数据的快速发展,通过更新回溯性决定规则并作出在线决定来学习最佳决策规则比以往容易得多。我们研究了在相继决策的背景土匪框架中模型参数的在线统计推论。我们提出了在线和适应性数据收集环境的总体框架,该框架可以通过加权随机梯度梯度下降更新决策规则。我们允许随机梯度的不同加权办法,并建立了参数测深器的无症状常态。我们提议的测深器通过反概率权重,大大提高了先前的平均 SGD 方法的无症状效率。我们还对线性回归环境中的权重进行了最佳分析。我们提供了拟议估算器的巴哈杜尔语代表,并表明由于适应性数据收集,巴哈杜尔语代表的剩余任期与古典 SGD 的融合速度要慢。

0

相关内容

上下文赌博机/上下文老虎机

上下文赌博机/上下文老虎机

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

肝细胞肝癌中高表达的PRC1基因功能及其受CTCF调控的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

分数排斥统计下低维相互作用量子气体的输运性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ca2+/Cofilin信号通路在电刺激促进神经元突起再生中的作用及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

气体在纳米孔内流动与换热的实验与分子动力学模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

二氧化硫/水/空气体系离子诱导成核机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

疏肝益肾方抗乳腺癌内分泌治疗耐药的增效机制

国家自然科学基金

0+阅读 · 2011年12月31日

磁性阻挫ABO3型锰氧化物中的磁电耦合及多铁性质的研究

国家自然科学基金

0+阅读 · 2009年12月31日

下地幔条件下MgO-SiO2-FeO-CaO熔体结构与热力学性质的多尺度计算

国家自然科学基金

0+阅读 · 2009年12月31日

量子微结构中的电子-光子耦合激发

国家自然科学基金

0+阅读 · 2008年12月31日

High Probability Convergence of Stochastic Gradient Methods

Arxiv

0+阅读 · 2023年2月28日

Particle-based Online Bayesian Sampling

Arxiv

0+阅读 · 2023年2月28日

Privacy of Noisy Stochastic Gradient Descent: More Iterations without More Privacy Loss

Privacy of Noisy Stochastic Gradient Descent: More Iterations without More Privacy Loss

Arxiv

0+阅读 · 2023年2月28日

Design-Based Inference for Multi-arm Bandits

Arxiv

0+阅读 · 2023年2月27日

Single-Call Stochastic Extragradient Methods for Structured Non-monotone Variational Inequalities: Improved Analysis under Weaker Conditions

Arxiv

0+阅读 · 2023年2月27日

Prediction-based Variable Selection for Component-wise Gradient Boosting

Arxiv

0+阅读 · 2023年2月27日

On the influence of stochastic roundoff errors and their bias on the convergence of the gradient descent method with low-precision floating-point computation

Arxiv

0+阅读 · 2023年2月25日

Data-driven uncertainty quantification for constrained stochastic differential equations and application to solar photovoltaic power forecast data

Arxiv

0+阅读 · 2023年2月25日

Statistical Inference with Stochastic Gradient Methods under $φ$-mixing Data

Arxiv

0+阅读 · 2023年2月24日

Preferential Subsampling for Stochastic Gradient Langevin Dynamics

Arxiv

0+阅读 · 2023年2月24日

VIP会员

文章信息

相关主题

上下文赌博机/上下文老虎机

赌博机/老虎机

随机梯度下降

估计/估计量

相关VIP内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机战争时代的战时法：大国竞争中的区分原则、相称性原则与行动建议》最新75页

《构建强健军事力量的设计挑战：提升海军兵力支持系统效能的多分辨率建模方法》69页

正视无人机心理战：恐惧效应与战略反思

《精确反蜂群防御系统：三维运动探测与定向空爆拦截技术融合》最新24页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

High Probability Convergence of Stochastic Gradient Methods

Arxiv

0+阅读 · 2023年2月28日

Particle-based Online Bayesian Sampling

Arxiv

0+阅读 · 2023年2月28日

Privacy of Noisy Stochastic Gradient Descent: More Iterations without More Privacy Loss

Privacy of Noisy Stochastic Gradient Descent: More Iterations without More Privacy Loss

Arxiv

0+阅读 · 2023年2月28日

Design-Based Inference for Multi-arm Bandits

Arxiv

0+阅读 · 2023年2月27日

Single-Call Stochastic Extragradient Methods for Structured Non-monotone Variational Inequalities: Improved Analysis under Weaker Conditions

Arxiv

0+阅读 · 2023年2月27日

Prediction-based Variable Selection for Component-wise Gradient Boosting

Arxiv

0+阅读 · 2023年2月27日

On the influence of stochastic roundoff errors and their bias on the convergence of the gradient descent method with low-precision floating-point computation

Arxiv

0+阅读 · 2023年2月25日

Data-driven uncertainty quantification for constrained stochastic differential equations and application to solar photovoltaic power forecast data

Arxiv

0+阅读 · 2023年2月25日

Statistical Inference with Stochastic Gradient Methods under $φ$-mixing Data

Arxiv

0+阅读 · 2023年2月24日

Preferential Subsampling for Stochastic Gradient Langevin Dynamics

Arxiv

0+阅读 · 2023年2月24日

相关基金

肝细胞肝癌中高表达的PRC1基因功能及其受CTCF调控的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

分数排斥统计下低维相互作用量子气体的输运性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ca2+/Cofilin信号通路在电刺激促进神经元突起再生中的作用及其机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

气体在纳米孔内流动与换热的实验与分子动力学模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

二氧化硫/水/空气体系离子诱导成核机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

疏肝益肾方抗乳腺癌内分泌治疗耐药的增效机制

国家自然科学基金

0+阅读 · 2011年12月31日

磁性阻挫ABO3型锰氧化物中的磁电耦合及多铁性质的研究

国家自然科学基金

0+阅读 · 2009年12月31日

下地幔条件下MgO-SiO2-FeO-CaO熔体结构与热力学性质的多尺度计算

国家自然科学基金

0+阅读 · 2009年12月31日

量子微结构中的电子-光子耦合激发

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员