检查制度下的强盗问题的有效影响 (Effective Dimension in Bandit Problems under Censorship) - 专知论文

会员服务 ·

0

赌博机/老虎机 · Analysis · 上下文赌博机/上下文老虎机 · 损失 · Continuity ·

2023 年 2 月 14 日

Effective Dimension in Bandit Problems under Censorship

翻译：检查制度下的强盗问题的有效影响

Gauthier Guinet,Saurabh Amin,Patrick Jaillet

from arxiv, 45 pages, 5 figures, NeurIPS 2022

In this paper, we study both multi-armed and contextual bandit problems in censored environments. Our goal is to estimate the performance loss due to censorship in the context of classical algorithms designed for uncensored environments. Our main contributions include the introduction of a broad class of censorship models and their analysis in terms of the effective dimension of the problem -- a natural measure of its underlying statistical complexity and main driver of the regret bound. In particular, the effective dimension allows us to maintain the structure of the original problem at first order, while embedding it in a bigger space, and thus naturally leads to results analogous to uncensored settings. Our analysis involves a continuous generalization of the Elliptical Potential Inequality, which we believe is of independent interest. We also discover an interesting property of decision-making under censorship: a transient phase during which initial misspecification of censorship is self-corrected at an extra cost, followed by a stationary phase that reflects the inherent slowdown of learning governed by the effective dimension. Our results are useful for applications of sequential decision-making models where the feedback received depends on strategic uncertainty (e.g., agents' willingness to follow a recommendation) and/or random uncertainty (e.g., loss or delay in arrival of information).

翻译：在本文中,我们研究了受审查环境中的多武装和背景土匪问题。我们的目标是根据为不受审查环境设计的古典算法来估计由于审查而导致的绩效损失。我们的主要贡献包括采用广泛的审查模式,并分析问题的有效层面 -- -- 其内在统计复杂性的自然度和造成遗憾的主要驱动因素。特别是,有效的维度使我们能够在最初的顺序上维持原始问题的结构,同时将其嵌入更大的空间,从而自然地导致类似未经审查的环境的结果。我们的分析涉及持续地普遍采用我们所认为具有独立兴趣的 Elliptical 潜在不平等。我们还发现了在审查下决策的有趣属性:在最初的错误区分审查以额外的成本自我纠正的过渡阶段,随后是反映有效维度所制约的内在学习减速的静止阶段。我们的结果有助于应用顺序决策模式,因为收到的反馈取决于战略不确定性(例如代理人对信息迟误)和随机不确定性(即信息迟误)以及(即信息迟误)的不确定性)。

0

相关内容

赌博机/老虎机

赌博机/老虎机

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

431+阅读 · 2021年1月11日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

与死亡结构域蛋白TRADD\FADD\RIP1互作的牛分枝杆菌蛋白的鉴定和功能研究

国家自然科学基金

0+阅读 · 2015年12月31日

20克级的水溶性Mn-Cu-In-S磁/光双功能量子点的制备

国家自然科学基金

0+阅读 · 2015年12月31日

Faecalibacterium prausnitzii协同LFA-1在炎症性肠病发生中调控淋巴细胞分化及功能的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤细胞中坏死基因Rip3的表达调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA-VNN3和OBFC2A在苯血液毒性中的功能及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

非线性偏微分方程的非线性微分约束

国家自然科学基金

1+阅读 · 2013年12月31日

解离型PPCPs在壳聚糖荷电纳滤膜中的去除机制

国家自然科学基金

0+阅读 · 2012年12月31日

炭疽杆菌S-层蛋白BA3338功能研究

国家自然科学基金

0+阅读 · 2010年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

非线性不连续系统的稳定与镇定

国家自然科学基金

0+阅读 · 2008年12月31日

Expert-Independent Generalization of Well and Seismic Data Using Machine Learning Methods for Complex Reservoirs Predicting During Early-Stage Geological Exploration

Arxiv

0+阅读 · 2023年4月6日

A Visual Active Search Framework for Geospatial Exploration

Arxiv

0+阅读 · 2023年4月5日

Learn to Grasp via Intention Discovery and its Application to Challenging Clutter

Arxiv

0+阅读 · 2023年4月5日

On Complexity of 1-Center in Various Metrics

Arxiv

0+阅读 · 2023年4月4日

Grid-SD2E: A General Grid-Feedback in a System for Cognitive Learning

Arxiv

0+阅读 · 2023年4月4日

Infinite-dimensional integration and $L^2$-approximation on Hermite spaces

Arxiv

0+阅读 · 2023年4月4日

QUICstep: Circumventing QUIC-based Censorship

Arxiv

0+阅读 · 2023年4月3日

Online Algorithms for Hierarchical Inference in Deep Learning applications at the Edge

Arxiv

0+阅读 · 2023年4月3日

Demonstration of InsightPilot: An LLM-Empowered Automated Data Exploration System

Arxiv

0+阅读 · 2023年4月2日

Matrix Decomposition and Applications

Arxiv

54+阅读 · 2022年1月1日

VIP会员

文章信息

相关主题

赌博机/老虎机

上下文赌博机/上下文老虎机

相关VIP内容

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

72+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

431+阅读 · 2021年1月11日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Expert-Independent Generalization of Well and Seismic Data Using Machine Learning Methods for Complex Reservoirs Predicting During Early-Stage Geological Exploration

Arxiv

0+阅读 · 2023年4月6日

A Visual Active Search Framework for Geospatial Exploration

Arxiv

0+阅读 · 2023年4月5日

Learn to Grasp via Intention Discovery and its Application to Challenging Clutter

Arxiv

0+阅读 · 2023年4月5日

On Complexity of 1-Center in Various Metrics

Arxiv

0+阅读 · 2023年4月4日

Grid-SD2E: A General Grid-Feedback in a System for Cognitive Learning

Arxiv

0+阅读 · 2023年4月4日

Infinite-dimensional integration and $L^2$-approximation on Hermite spaces

Arxiv

0+阅读 · 2023年4月4日

QUICstep: Circumventing QUIC-based Censorship

Arxiv

0+阅读 · 2023年4月3日

Online Algorithms for Hierarchical Inference in Deep Learning applications at the Edge

Arxiv

0+阅读 · 2023年4月3日

Demonstration of InsightPilot: An LLM-Empowered Automated Data Exploration System

Arxiv

0+阅读 · 2023年4月2日

Matrix Decomposition and Applications

Arxiv

54+阅读 · 2022年1月1日

相关基金

与死亡结构域蛋白TRADD\FADD\RIP1互作的牛分枝杆菌蛋白的鉴定和功能研究

国家自然科学基金

0+阅读 · 2015年12月31日

20克级的水溶性Mn-Cu-In-S磁/光双功能量子点的制备

国家自然科学基金

0+阅读 · 2015年12月31日

Faecalibacterium prausnitzii协同LFA-1在炎症性肠病发生中调控淋巴细胞分化及功能的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤细胞中坏死基因Rip3的表达调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA-VNN3和OBFC2A在苯血液毒性中的功能及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

非线性偏微分方程的非线性微分约束

国家自然科学基金

1+阅读 · 2013年12月31日

解离型PPCPs在壳聚糖荷电纳滤膜中的去除机制

国家自然科学基金

0+阅读 · 2012年12月31日

炭疽杆菌S-层蛋白BA3338功能研究

国家自然科学基金

0+阅读 · 2010年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

非线性不连续系统的稳定与镇定

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员