Cooperative Thresholded Lasso for Sparse Linear Bandit - 专知论文

会员服务 ·

0

赌博机/老虎机 · 线性的 · Agent · 稀疏 · Networking ·

2023 年 5 月 30 日

Cooperative Thresholded Lasso for Sparse Linear Bandit

翻译：暂无翻译

Haniyeh Barghi,Xiaotong Cheng,Setareh Maghsudi

We present a novel approach to address the multi-agent sparse contextual linear bandit problem, in which the feature vectors have a high dimension $d$ whereas the reward function depends on only a limited set of features - precisely $s_0 \ll d$. Furthermore, the learning follows under information-sharing constraints. The proposed method employs Lasso regression for dimension reduction, allowing each agent to independently estimate an approximate set of main dimensions and share that information with others depending on the network's structure. The information is then aggregated through a specific process and shared with all agents. Each agent then resolves the problem with ridge regression focusing solely on the extracted dimensions. We represent algorithms for both a star-shaped network and a peer-to-peer network. The approaches effectively reduce communication costs while ensuring minimal cumulative regret per agent. Theoretically, we show that our proposed methods have a regret bound of order $\mathcal{O}(s_0 \log d + s_0 \sqrt{T})$ with high probability, where $T$ is the time horizon. To our best knowledge, it is the first algorithm that tackles row-wise distributed data in sparse linear bandits, achieving comparable performance compared to the state-of-the-art single and multi-agent methods. Besides, it is widely applicable to high-dimensional multi-agent problems where efficient feature extraction is critical for minimizing regret. To validate the effectiveness of our approach, we present experimental results on both synthetic and real-world datasets.

翻译：暂无翻译

0

相关内容

赌博机/老虎机

赌博机/老虎机

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

网络环境下非线性互联大系统的模糊双曲建模和鲁棒控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

国债收益率曲线与国债期货的互动关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

受限制策略下多臂Bandit过程的理论与应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

一类信用衍生品的定价和优化控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

贝叶斯框架下风险度量的非参数估计及其应用研究

国家自然科学基金

1+阅读 · 2012年12月31日

Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Perspectives

Arxiv

0+阅读 · 2023年7月18日

Minimum Target Sets in Non-Progressive Threshold Models: When Timing Matters

Arxiv

0+阅读 · 2023年7月18日

Minimax Rates for High-dimensional Double Sparse Structure over $\ell_u(\ell_q)$-balls

Arxiv

0+阅读 · 2023年7月18日

A $(3/2 + \varepsilon)$-Approximation for Multiple TSP with a Variable Number of Depots

Arxiv

0+阅读 · 2023年7月14日

Optimal Symmetric Strategies in Multi-Agent Systems with Decentralized Information

Arxiv

0+阅读 · 2023年7月14日

VIP会员

文章信息

相关主题

赌博机/老虎机

相关VIP内容

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大模型推理时代的知识编辑

《利用人工智能对军事行动进行建模》

【MIT博士论文】加速科学发现的因果建模实践算法

机器人、无人机与实时影像：应对城市爆炸威胁的三大技术方案

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Sparse Gaussian Graphical Models with Discrete Optimization: Computational and Statistical Perspectives

Arxiv

0+阅读 · 2023年7月18日

Minimum Target Sets in Non-Progressive Threshold Models: When Timing Matters

Arxiv

0+阅读 · 2023年7月18日

Minimax Rates for High-dimensional Double Sparse Structure over $\ell_u(\ell_q)$-balls

Arxiv

0+阅读 · 2023年7月18日

A $(3/2 + \varepsilon)$-Approximation for Multiple TSP with a Variable Number of Depots

Arxiv

0+阅读 · 2023年7月14日

Optimal Symmetric Strategies in Multi-Agent Systems with Decentralized Information

Arxiv

0+阅读 · 2023年7月14日

相关基金

网络环境下非线性互联大系统的模糊双曲建模和鲁棒控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

国债收益率曲线与国债期货的互动关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

受限制策略下多臂Bandit过程的理论与应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

一类信用衍生品的定价和优化控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

贝叶斯框架下风险度量的非参数估计及其应用研究

国家自然科学基金

1+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员