KKHS 强盗的理论基础方法 (Approximation Theory Based Methods for RKHS Bandits) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 近似 · Extensibility · CC · 线性的 ·

2021 年 7 月 26 日

Approximation Theory Based Methods for RKHS Bandits

翻译：KKHS 强盗的理论基础方法

Sho Takemori,Masahiro Sato

The RKHS bandit problem (also called kernelized multi-armed bandit problem) is an online optimization problem of non-linear functions with noisy feedback. Although the problem has been extensively studied, there are unsatisfactory results for some problems compared to the well-studied linear bandit case. Specifically, there is no general algorithm for the adversarial RKHS bandit problem. In addition, high computational complexity of existing algorithms hinders practical application. We address these issues by considering a novel amalgamation of approximation theory and the misspecified linear bandit problem. Using an approximation method, we propose efficient algorithms for the stochastic RKHS bandit problem and the first general algorithm for the adversarial RKHS bandit problem. Furthermore, we empirically show that one of our proposed methods has comparable cumulative regret to IGP-UCB and its running time is much shorter.

翻译：RKHS 土匪问题(也称为内部多武装土匪问题) 是非线性功能的在线优化问题, 且有吵闹的反馈。虽然这个问题已经进行了广泛研究, 但与研究周密的线性土匪案件相比,有些问题的结果并不令人满意。具体地说, 敌对的RKHS 土匪问题没有通用算法。此外, 现有算法的计算复杂性阻碍了实际应用。我们通过考虑将近似理论和错误指定的线性土匪问题进行新颖的合并来解决这些问题。我们使用近似法, 提出了关于Stochatic RKHS 土匪问题的有效算法, 以及敌性RKHS 土匪问题的第一个总算法。此外, 我们从经验上表明, 我们提出的方法之一与IGP- UCB 累积了同样的遗憾, 其运行时间要短得多。

0

相关内容

赌博机/老虎机

赌博机/老虎机

【经典书】主动学习理论，226页pdf，Theory of Active Learning

【经典书】主动学习理论，226页pdf，Theory of Active Learning

专知会员服务

127+阅读 · 2021年7月14日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【泡泡一分钟】一种基于光场的快速有效深度图估计方法（3dv-43）

【泡泡一分钟】一种基于光场的快速有效深度图估计方法（3dv-43）

泡泡机器人SLAM

4+阅读 · 2018年2月11日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Optimal policy evaluation using kernel-based temporal difference methods

Arxiv

0+阅读 · 2021年9月24日

Sparse tensor product approximation for a class of generalized method of moments estimators

Arxiv

0+阅读 · 2021年9月24日

Regret Lower Bound and Optimal Algorithm for High-Dimensional Contextual Linear Bandit

Arxiv

0+阅读 · 2021年9月23日

Outlier-Robust Sparse Estimation via Non-Convex Optimization

Arxiv

0+阅读 · 2021年9月23日

Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

Arxiv

0+阅读 · 2021年9月23日

Weighted Low Rank Matrix Approximation and Acceleration

Arxiv

0+阅读 · 2021年9月22日

On Optimal Robustness to Adversarial Corruption in Online Decision Problems

Arxiv

0+阅读 · 2021年9月22日

Bounds on approximating Max $k$XOR with quantum and classical local algorithms

Arxiv

0+阅读 · 2021年9月22日

Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation

Arxiv

5+阅读 · 2020年4月2日

Variance Reduction Methods for Sublinear Reinforcement Learning

Arxiv

4+阅读 · 2018年4月25日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

【经典书】主动学习理论，226页pdf，Theory of Active Learning

【经典书】主动学习理论，226页pdf，Theory of Active Learning

专知会员服务

127+阅读 · 2021年7月14日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

246+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

【斯坦福博士论文】数据、决策与依赖：构建可信人工智能的挑战

人工智能时代背景下的未来海战

接触战中的无人机优势：美军旅级部队面临的小型无人机系统挑战与调整

相关资讯

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【泡泡一分钟】一种基于光场的快速有效深度图估计方法（3dv-43）

【泡泡一分钟】一种基于光场的快速有效深度图估计方法（3dv-43）

泡泡机器人SLAM

4+阅读 · 2018年2月11日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Optimal policy evaluation using kernel-based temporal difference methods

Arxiv

0+阅读 · 2021年9月24日

Sparse tensor product approximation for a class of generalized method of moments estimators

Arxiv

0+阅读 · 2021年9月24日

Regret Lower Bound and Optimal Algorithm for High-Dimensional Contextual Linear Bandit

Arxiv

0+阅读 · 2021年9月23日

Outlier-Robust Sparse Estimation via Non-Convex Optimization

Arxiv

0+阅读 · 2021年9月23日

Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

Arxiv

0+阅读 · 2021年9月23日

Weighted Low Rank Matrix Approximation and Acceleration

Arxiv

0+阅读 · 2021年9月22日

On Optimal Robustness to Adversarial Corruption in Online Decision Problems

Arxiv

0+阅读 · 2021年9月22日

Bounds on approximating Max $k$XOR with quantum and classical local algorithms

Arxiv

0+阅读 · 2021年9月22日

Hierarchical Adaptive Contextual Bandits for Resource Constraint based Recommendation

Arxiv

5+阅读 · 2020年4月2日

Variance Reduction Methods for Sublinear Reinforcement Learning

Arxiv

4+阅读 · 2018年4月25日

微信扫码咨询专知VIP会员