感觉乐观吗? 在线决策的模糊态度</s> (Feeling Optimistic? Ambiguity Attitudes for Online Decision Making) - 专知论文

会员服务 ·

0

INFORMS · 稳健性 · Feel · 回合 · Agent ·

2023 年 3 月 7 日

Feeling Optimistic? Ambiguity Attitudes for Online Decision Making

翻译：感觉乐观吗? 在线决策的模糊态度

Jared J. Beard,R. Michael Butts,Yu Gu

from arxiv, 8 pages, 9 figures, 2 algorithms. Submitted to the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems in Detroit, Michigan USA (Oct 1-5, 2023)

As autonomous agents enter complex environments, it becomes more difficult to adequately model the interactions between the two. Agents must therefore cope with greater ambiguity (e.g., unknown environments, underdefined models, and vague problem definitions). Despite the consequences of ignoring ambiguity, tools for decision making under ambiguity are understudied. The general approach has been to avoid ambiguity (exploit known information) using robust methods. This work contributes ambiguity attitude graph search (AAGS), generalizing robust methods with ambiguity attitudes--the ability to trade-off between seeking and avoiding ambiguity in the problem. AAGS solves online decision making problems with limited budget to learn about their environment. To evaluate this approach AAGS is tasked with path planning in static and dynamic environments. Results demonstrate that appropriate ambiguity attitudes are dependent on the quality of information from the environment. In relatively certain environments, AAGS can readily exploit information with robust policies. Conversely, model complexity reduces the information conveyed by individual samples; this allows the risks taken by optimistic policies to achieve better performance.

翻译：随着自主代理商进入复杂的环境,就更难充分模拟两者之间的相互作用。因此,代理商必须应对更大的模糊性(例如,未知的环境、定义不足的模式和模糊的问题定义)。尽管忽视模糊性的后果,但在模糊性的决策工具方面研究不足。一般的做法是使用稳健的方法避免模糊性(开发已知信息),这项工作有助于模糊性态度图搜索(AGS),推广稳健方法,在寻求和避免问题之间取舍的模棱两可性。AGS解决了在线决策问题,预算有限,了解环境。评估AAGS的任务是在静态和动态环境中进行路径规划。结果表明,适当的模糊性态度取决于环境信息的质量。在相对特定的环境中,AGS可以随时以稳健的政策利用信息。相反,模型的复杂性会减少单个样本提供的信息;这就使得乐观政策所冒的风险,从而取得更好的业绩。</s>

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

内质网Ca2+感受器STIM1调控糖尿病冠状动脉平滑肌细胞表型转化的机制

国家自然科学基金

0+阅读 · 2014年12月31日

胃蛋白酶在喉咽上皮细胞炎症恶性转化中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

组团参加国际光学联合会大会

国家自然科学基金

0+阅读 · 2012年8月18日

吡唑基铜配合物的合成与组装

国家自然科学基金

0+阅读 · 2011年12月31日

新型酰胺衍生物合成与抑菌活性研究

国家自然科学基金

0+阅读 · 2009年12月31日

DNA损伤诱导的p53非依赖性细胞凋亡途径- - -Bim途径

国家自然科学基金

0+阅读 · 2009年12月31日

肾康丸对糖尿病肾病大鼠miR-192介导通路的影响

国家自然科学基金

1+阅读 · 2009年12月31日

Mather理论与Hamilton-Jacobi方程的粘性解

国家自然科学基金

0+阅读 · 2009年12月31日

Ga、Al、In氮化物及其合金和径向异质结纳米线的可控制备和物性研究

国家自然科学基金

0+阅读 · 2008年12月31日

The Power of Typed Affine Decision Structures: A Case Study

Arxiv

0+阅读 · 2023年4月28日

Client Recruitment for Federated Learning in ICU Length of Stay Prediction

Arxiv

0+阅读 · 2023年4月28日

Dynamic Pricing and Learning with Bayesian Persuasion

Arxiv

0+阅读 · 2023年4月27日

A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints

Arxiv

0+阅读 · 2023年4月27日

Level Assembly as a Markov Decision Process

Arxiv

0+阅读 · 2023年4月27日

Data-driven Piecewise Affine Decision Rules for Stochastic Programming with Covariate Information

Arxiv

0+阅读 · 2023年4月26日

Positive Difference Distribution for Image Outlier Detection using Normalizing Flows and Contrastive Data

Arxiv

0+阅读 · 2023年4月26日

The Update Equivalence Framework for Decision-Time Planning

Arxiv

0+阅读 · 2023年4月25日

Dynamic neighbourhood optimisation for task allocation using multi-agent

Arxiv

101+阅读 · 2022年5月11日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型时代的文档智能：综述

蜂窝通信是否是无人机与无人地面战车主宰战场的关键？

文档视觉问答简述

最新新Agent综述！76页327篇论文梳理，北交大桑基韬教授团队发布《迈向模型原生智能体式人工智能的范式转变综述》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

The Power of Typed Affine Decision Structures: A Case Study

Arxiv

0+阅读 · 2023年4月28日

Client Recruitment for Federated Learning in ICU Length of Stay Prediction

Arxiv

0+阅读 · 2023年4月28日

Dynamic Pricing and Learning with Bayesian Persuasion

Arxiv

0+阅读 · 2023年4月27日

A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints

Arxiv

0+阅读 · 2023年4月27日

Level Assembly as a Markov Decision Process

Arxiv

0+阅读 · 2023年4月27日

Data-driven Piecewise Affine Decision Rules for Stochastic Programming with Covariate Information

Arxiv

0+阅读 · 2023年4月26日

Positive Difference Distribution for Image Outlier Detection using Normalizing Flows and Contrastive Data

Arxiv

0+阅读 · 2023年4月26日

The Update Equivalence Framework for Decision-Time Planning

Arxiv

0+阅读 · 2023年4月25日

Dynamic neighbourhood optimisation for task allocation using multi-agent

Arxiv

101+阅读 · 2022年5月11日

Generalized Out-of-Distribution Detection: A Survey

Generalized Out-of-Distribution Detection: A Survey

Arxiv

15+阅读 · 2021年10月21日

相关基金

内质网Ca2+感受器STIM1调控糖尿病冠状动脉平滑肌细胞表型转化的机制

国家自然科学基金

0+阅读 · 2014年12月31日

胃蛋白酶在喉咽上皮细胞炎症恶性转化中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

组团参加国际光学联合会大会

国家自然科学基金

0+阅读 · 2012年8月18日

吡唑基铜配合物的合成与组装

国家自然科学基金

0+阅读 · 2011年12月31日

新型酰胺衍生物合成与抑菌活性研究

国家自然科学基金

0+阅读 · 2009年12月31日

DNA损伤诱导的p53非依赖性细胞凋亡途径- - -Bim途径

国家自然科学基金

0+阅读 · 2009年12月31日

肾康丸对糖尿病肾病大鼠miR-192介导通路的影响

国家自然科学基金

1+阅读 · 2009年12月31日

Mather理论与Hamilton-Jacobi方程的粘性解

国家自然科学基金

0+阅读 · 2009年12月31日

Ga、Al、In氮化物及其合金和径向异质结纳米线的可控制备和物性研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员