分级- 不可知性拉索盗贼 (Sparsity-Agnostic Lasso Bandit) - 专知论文

会员服务 ·

0

赌博机/老虎机 · 特化 · Performer · 上下文赌博机/上下文老虎机 · 稀疏 ·

2021 年 4 月 28 日

Sparsity-Agnostic Lasso Bandit

翻译：分级- 不可知性拉索盗贼

Min-hwan Oh,Garud Iyengar,Assaf Zeevi

We consider a stochastic contextual bandit problem where the dimension $d$ of the feature vectors is potentially large, however, only a sparse subset of features of cardinality $s_0 \ll d$ affect the reward function. Essentially all existing algorithms for sparse bandits require a priori knowledge of the value of the sparsity index $s_0$. This knowledge is almost never available in practice, and misspecification of this parameter can lead to severe deterioration in the performance of existing methods. The main contribution of this paper is to propose an algorithm that does not require prior knowledge of the sparsity index $s_0$ and establish tight regret bounds on its performance under mild conditions. We also comprehensively evaluate our proposed algorithm numerically and show that it consistently outperforms existing methods, even when the correct sparsity index is revealed to them but is kept hidden from our algorithm.

翻译：我们认为,如果地物矢量的维度为美元,可能非常大,则其背景土匪问题就是一个隐形问题,然而,只有几小部分的基点特征才会影响奖励功能。对于稀土强盗来说,所有现有的算法基本上都需要事先了解聚度指数值($s_0美元),而这种知识在实践中几乎从未存在,而这一参数的错误区分可能导致现有方法的性能严重恶化。本文的主要贡献是提出一种算法,这种算法不需要事先了解聚度指数($s_0美元),并且对其在温和条件下的性能建立严格的遗憾界限。我们还全面地从数字上评估了我们提议的算法,并表明它始终超越了现有方法,即使正确的聚度指数暴露给了它们,但却隐藏在我们的算法中。

0

相关内容

赌博机/老虎机

赌博机/老虎机

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

专知会员服务

93+阅读 · 2020年5月6日

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

专知会员服务

10+阅读 · 2020年1月7日

【2019 北京智源大会】Cognitive Graph in Practice with their Applications in E-commerce Recommendation (图神经网络实践及在电子商务推荐中的应用) 杨红霞 / 阿里巴巴资深算法专家

【2019 北京智源大会】Cognitive Graph in Practice with their Applications in E-commerce Recommendation (图神经网络实践及在电子商务推荐中的应用) 杨红霞 / 阿里巴巴资深算法专家

专知会员服务

24+阅读 · 2019年11月2日

【推荐系统/计算广告/机器学习/CTR预估资料汇总】

【推荐系统/计算广告/机器学习/CTR预估资料汇总】

专知会员服务

88+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

已删除

将门创投

5+阅读 · 2019年10月29日

Variational Combinatorial Sequential Monte Carlo Methods for Bayesian Phylogenetic Inference

Arxiv

0+阅读 · 2021年6月17日

Differentially Private Hamiltonian Monte Carlo

Arxiv

0+阅读 · 2021年6月17日

Optimal Non-Adaptive Probabilistic Group Testing in General Sparsity Regimes

Arxiv

0+阅读 · 2021年6月17日

Thompson Sampling with Information Relaxation Penalties

Arxiv

0+阅读 · 2021年6月16日

Covariance Matrix Estimation with Non Uniform and Data Dependent Missing Observations

Arxiv

0+阅读 · 2021年6月16日

Improved Regret Bounds for Online Submodular Maximization

Arxiv

0+阅读 · 2021年6月15日

Boosting in the Presence of Massart Noise

Arxiv

0+阅读 · 2021年6月14日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

Variational Bayesian Reinforcement Learning with Regret Bounds

Arxiv

3+阅读 · 2018年7月25日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

VIP会员

文章信息

相关主题

赌博机/老虎机

上下文赌博机/上下文老虎机

相关VIP内容

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

专知会员服务

93+阅读 · 2020年5月6日

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

【ML课程】多变量微积分（Multivariable Calculus），加州大学伯克利分校| Prof. Denis Auroux

专知会员服务

10+阅读 · 2020年1月7日

【2019 北京智源大会】Cognitive Graph in Practice with their Applications in E-commerce Recommendation (图神经网络实践及在电子商务推荐中的应用) 杨红霞 / 阿里巴巴资深算法专家

【2019 北京智源大会】Cognitive Graph in Practice with their Applications in E-commerce Recommendation (图神经网络实践及在电子商务推荐中的应用) 杨红霞 / 阿里巴巴资深算法专家

专知会员服务

24+阅读 · 2019年11月2日

【推荐系统/计算广告/机器学习/CTR预估资料汇总】

【推荐系统/计算广告/机器学习/CTR预估资料汇总】

专知会员服务

88+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

已删除

将门创投

5+阅读 · 2019年10月29日

相关论文

Variational Combinatorial Sequential Monte Carlo Methods for Bayesian Phylogenetic Inference

Arxiv

0+阅读 · 2021年6月17日

Differentially Private Hamiltonian Monte Carlo

Arxiv

0+阅读 · 2021年6月17日

Optimal Non-Adaptive Probabilistic Group Testing in General Sparsity Regimes

Arxiv

0+阅读 · 2021年6月17日

Thompson Sampling with Information Relaxation Penalties

Arxiv

0+阅读 · 2021年6月16日

Covariance Matrix Estimation with Non Uniform and Data Dependent Missing Observations

Arxiv

0+阅读 · 2021年6月16日

Improved Regret Bounds for Online Submodular Maximization

Arxiv

0+阅读 · 2021年6月15日

Boosting in the Presence of Massart Noise

Arxiv

0+阅读 · 2021年6月14日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

Variational Bayesian Reinforcement Learning with Regret Bounds

Arxiv

3+阅读 · 2018年7月25日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

微信扫码咨询专知VIP会员