GBOSE: 通用大盗矫形正形半对称估计 (GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation) - 专知论文

会员服务 ·

0

赌博机/老虎机 · state-of-the-art · 估计/估计量 · 正交 · MoDELS ·

2023 年 1 月 20 日

GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation

翻译：GBOSE: 通用大盗矫形正形半对称估计

Mubarrat Chowdhury,Elkhan Ismayilzada,Khalequzzaman Sayem,Gi-Soo Kim

In sequential decision-making scenarios i.e., mobile health recommendation systems revenue management contextual multi-armed bandit algorithms have garnered attention for their performance. But most of the existing algorithms are built on the assumption of a strictly parametric reward model mostly linear in nature. In this work we propose a new algorithm with a semi-parametric reward model with state-of-the-art complexity of upper bound on regret amongst existing semi-parametric algorithms. Our work expands the scope of another representative algorithm of state-of-the-art complexity with a similar reward model by proposing an algorithm built upon the same action filtering procedures but provides explicit action selection distribution for scenarios involving more than two arms at a particular time step while requiring fewer computations. We derive the said complexity of the upper bound on regret and present simulation results that affirm our methods superiority out of all prevalent semi-parametric bandit algorithms for cases involving over two arms.

翻译：在一系列决策假设中,即移动式保健建议系统收入管理,多武装强盗算法在其性能方面引起了注意。但大多数现有算法都是建立在假设严格的参数性奖赏模式的基础之上,其中大部分是线性性质的。在这项工作中,我们提出了一种新的算法,其半参数性奖赏模式具有最先进的复杂程度,在现有半参数性算法中,其上层的遗憾程度具有最先进的复杂程度。我们的工作扩大了另一个具有类似奖赏模式的现代复杂程度的代表性算法的范围,即提议一种基于同一行动过滤程序的算法,但为在特定时间步骤中涉及两个以上武器的情景提供明确的行动选择分布,同时要求较少的计算。我们得出了所述在遗憾上层的复杂程度,并提出模拟结果,确认我们的方法优于所有涉及两个武器的案件的流行的半参数性强算法。

0

相关内容

赌博机/老虎机

赌博机/老虎机

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

多酚结构特性、相互作用和抗氧化效应（协同、拮抗或加成）间的关系

国家自然科学基金

0+阅读 · 2014年12月31日

蓖麻矮化相关RcDof基因功能分析及调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

非高斯型连续变量纠缠态的非高斯调控及其纠缠性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

嗜盐古菌CRISPR/Cas系统与基因组稳定性机制

国家自然科学基金

0+阅读 · 2012年12月31日

纳米聚集态金属有机骨架材料的可控制备及形成机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

有限维Banach几何与关于凸体覆盖的Hadwiger猜想

国家自然科学基金

0+阅读 · 2012年12月31日

Gata6对血管损伤修复和动脉粥样硬化形成的作用及其机制

国家自然科学基金

0+阅读 · 2011年12月31日

从内质网应激信号通路研究加味阳和汤对骨性关节炎的早期作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

广义Kloosterman和的均值估计

国家自然科学基金

1+阅读 · 2011年12月31日

渗流及相关随机系统的极限行为研究

国家自然科学基金

0+阅读 · 2009年12月31日

Estimation of continuous environments by robot swarms: Correlated networks and decision-making

Arxiv

0+阅读 · 2023年3月15日

Transferability Estimation Based On Principal Gradient Expectation

Arxiv

0+阅读 · 2023年3月15日

Quantum Steering Algorithm for Estimating Fidelity of Separability

Arxiv

0+阅读 · 2023年3月14日

A Unified BEV Model for Joint Learning of 3D Local Features and Overlap Estimation

Arxiv

0+阅读 · 2023年3月14日

High-Dimensional Dynamic Pricing under Non-Stationarity: Learning and Earning with Change-Point Detection

Arxiv

0+阅读 · 2023年3月14日

Parametric Estimation of Tempered Stable Laws

Arxiv

0+阅读 · 2023年3月13日

Contrastive Representation Learning for Acoustic Parameter Estimation

Arxiv

0+阅读 · 2023年3月13日

Estimating a potential without the agony of the partition function

Arxiv

0+阅读 · 2023年3月11日

Privacy-Preserving and Lossless Distributed Estimation of High-Dimensional Generalized Additive Mixed Models

Arxiv

0+阅读 · 2023年3月10日

Machine Learning-based Framework for Optimally Solving the Analytical Inverse Kinematics for Redundant Manipulators

Arxiv

0+阅读 · 2023年3月9日

VIP会员

文章信息

相关主题

赌博机/老虎机

state-of-the-art

估计/估计量

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津大学博士论文】将序列结构与几何结构融入深度神经网络

工程视角：影响战争进程的小型无人机

企业级AI应用开发：从技术选型到生产落地

AI生成代码缺陷综述

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Estimation of continuous environments by robot swarms: Correlated networks and decision-making

Arxiv

0+阅读 · 2023年3月15日

Transferability Estimation Based On Principal Gradient Expectation

Arxiv

0+阅读 · 2023年3月15日

Quantum Steering Algorithm for Estimating Fidelity of Separability

Arxiv

0+阅读 · 2023年3月14日

A Unified BEV Model for Joint Learning of 3D Local Features and Overlap Estimation

Arxiv

0+阅读 · 2023年3月14日

High-Dimensional Dynamic Pricing under Non-Stationarity: Learning and Earning with Change-Point Detection

Arxiv

0+阅读 · 2023年3月14日

Parametric Estimation of Tempered Stable Laws

Arxiv

0+阅读 · 2023年3月13日

Contrastive Representation Learning for Acoustic Parameter Estimation

Arxiv

0+阅读 · 2023年3月13日

Estimating a potential without the agony of the partition function

Arxiv

0+阅读 · 2023年3月11日

Privacy-Preserving and Lossless Distributed Estimation of High-Dimensional Generalized Additive Mixed Models

Arxiv

0+阅读 · 2023年3月10日

Machine Learning-based Framework for Optimally Solving the Analytical Inverse Kinematics for Redundant Manipulators

Arxiv

0+阅读 · 2023年3月9日

相关基金

多酚结构特性、相互作用和抗氧化效应（协同、拮抗或加成）间的关系

国家自然科学基金

0+阅读 · 2014年12月31日

蓖麻矮化相关RcDof基因功能分析及调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

非高斯型连续变量纠缠态的非高斯调控及其纠缠性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

嗜盐古菌CRISPR/Cas系统与基因组稳定性机制

国家自然科学基金

0+阅读 · 2012年12月31日

纳米聚集态金属有机骨架材料的可控制备及形成机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

有限维Banach几何与关于凸体覆盖的Hadwiger猜想

国家自然科学基金

0+阅读 · 2012年12月31日

Gata6对血管损伤修复和动脉粥样硬化形成的作用及其机制

国家自然科学基金

0+阅读 · 2011年12月31日

从内质网应激信号通路研究加味阳和汤对骨性关节炎的早期作用机制

国家自然科学基金

0+阅读 · 2011年12月31日

广义Kloosterman和的均值估计

国家自然科学基金

1+阅读 · 2011年12月31日

渗流及相关随机系统的极限行为研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员