On High-dimensional and Low-rank Tensor Bandits - 专知论文

会员服务 ·

0

赌博机/老虎机 · Tensor · 估计/估计量 · 约束 · Subspace ·

2023 年 5 月 6 日

On High-dimensional and Low-rank Tensor Bandits

翻译：暂无翻译

Chengshuai Shi,Cong Shen,Nicholas D. Sidiropoulos

from arxiv, Accepted to the 2023 IEEE International Symposium on Information Theory (ISIT 2023)

Most existing studies on linear bandits focus on the one-dimensional characterization of the overall system. While being representative, this formulation may fail to model applications with high-dimensional but favorable structures, such as the low-rank tensor representation for recommender systems. To address this limitation, this work studies a general tensor bandits model, where actions and system parameters are represented by tensors as opposed to vectors, and we particularly focus on the case that the unknown system tensor is low-rank. A novel bandit algorithm, coined TOFU (Tensor Optimism in the Face of Uncertainty), is developed. TOFU first leverages flexible tensor regression techniques to estimate low-dimensional subspaces associated with the system tensor. These estimates are then utilized to convert the original problem to a new one with norm constraints on its system parameters. Lastly, a norm-constrained bandit subroutine is adopted by TOFU, which utilizes these constraints to avoid exploring the entire high-dimensional parameter space. Theoretical analyses show that TOFU improves the best-known regret upper bound by a multiplicative factor that grows exponentially in the system order. A novel performance lower bound is also established, which further corroborates the efficiency of TOFU.

翻译：暂无翻译

0

相关内容

赌博机/老虎机

赌博机/老虎机

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

纤维素/甲壳素在碱/尿素/水三元溶剂体系中的微观相结构及相互作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

胶体溶液中介质分子诱导亚稳态单质纳米晶的生长与相变研究

国家自然科学基金

0+阅读 · 2013年12月31日

流化超细颗粒相间作用与颗粒动理学的研究

国家自然科学基金

0+阅读 · 2012年12月31日

聚电解质经纳米管道输运的动力学性质

国家自然科学基金

0+阅读 · 2008年12月31日

高电荷态离子与原子碰撞过程的密耦理论研究

国家自然科学基金

0+阅读 · 2008年12月31日

Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms

Arxiv

0+阅读 · 2023年6月22日

Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP

Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP

Arxiv

0+阅读 · 2023年6月21日

High-dimensional Contextual Bandit Problem without Sparsity

Arxiv

0+阅读 · 2023年6月19日

Non-asymptotic System Identification for Linear Systems with Nonlinear Policies

Arxiv

0+阅读 · 2023年6月17日

Tensor BM-Decomposition for Compression and Analysis of Spatio-Temporal Third-order Data

Arxiv

0+阅读 · 2023年6月17日

VIP会员

文章信息

相关主题

赌博机/老虎机

估计/估计量

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机系统 - 反无人机系统：测试方法》364页

《无人机蜂群攻击防御的预测建模：面向美军战备的人工智能轨迹预测与最优拦截策略设计》最新报告

美军低成本无人作战攻击系统（LUCAS）：扩大无人机战争规模

《将空中力量带向海洋：美国海军航空发展的四条竞争路径及其教训》报告

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

相关论文

Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms

Arxiv

0+阅读 · 2023年6月22日

Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP

Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP

Arxiv

0+阅读 · 2023年6月21日

High-dimensional Contextual Bandit Problem without Sparsity

Arxiv

0+阅读 · 2023年6月19日

Non-asymptotic System Identification for Linear Systems with Nonlinear Policies

Arxiv

0+阅读 · 2023年6月17日

Tensor BM-Decomposition for Compression and Analysis of Spatio-Temporal Third-order Data

Arxiv

0+阅读 · 2023年6月17日

相关基金

纤维素/甲壳素在碱/尿素/水三元溶剂体系中的微观相结构及相互作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

胶体溶液中介质分子诱导亚稳态单质纳米晶的生长与相变研究

国家自然科学基金

0+阅读 · 2013年12月31日

流化超细颗粒相间作用与颗粒动理学的研究

国家自然科学基金

0+阅读 · 2012年12月31日

聚电解质经纳米管道输运的动力学性质

国家自然科学基金

0+阅读 · 2008年12月31日

高电荷态离子与原子碰撞过程的密耦理论研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员