从偏向中推导按语法顺序排列的奖赏 (Inferring Lexicographically-Ordered Rewards from Preferences) - 专知论文

会员服务 ·

0

推断 · 泛函 · Better · 奖励函数 · MoDELS ·

2022 年 2 月 21 日

Inferring Lexicographically-Ordered Rewards from Preferences

翻译：从偏向中推导按语法顺序排列的奖赏

Alihan Hüyük,William R. Zame,Mihaela van der Schaar

from arxiv, In Proceedings of the 36th AAAI Conference on Artificial Intelligence

Modeling the preferences of agents over a set of alternatives is a principal concern in many areas. The dominant approach has been to find a single reward/utility function with the property that alternatives yielding higher rewards are preferred over alternatives yielding lower rewards. However, in many settings, preferences are based on multiple, often competing, objectives; a single reward function is not adequate to represent such preferences. This paper proposes a method for inferring multi-objective reward-based representations of an agent's observed preferences. We model the agent's priorities over different objectives as entering lexicographically, so that objectives with lower priorities matter only when the agent is indifferent with respect to objectives with higher priorities. We offer two example applications in healthcare, one inspired by cancer treatment, the other inspired by organ transplantation, to illustrate how the lexicographically-ordered rewards we learn can provide a better understanding of a decision-maker's preferences and help improve policies when used in reinforcement learning.

翻译：在许多领域,以代理人的偏好为样板,这都是一个主要关切的领域。主要做法是找到单一的奖励/效用功能,财产中产生较高奖赏的替代办法优于产生较低奖赏的替代办法;然而,在许多环境中,偏好是基于多重的、往往相互竞争的目标;单一的奖赏功能不足以代表这种偏好。本文件提出了一种方法,用以推断一个代理人所观察到的偏好以多目标为基础的奖赏表示方式。我们把代理人对不同目标的优先考虑作为进入地名录,这样,只有当代理人对较高优先的目标漠不关心时,才使较低优先的目标变得重要。我们在保健领域举了两个例子,一个是癌症治疗引起的,另一个是器官移植的启发,以说明我们所学的按地名录排列的奖项如何能够更好地理解决策者的偏好,并在加强学习时帮助改进政策。

0

相关内容

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

苦荞主要产量相关性状的全基因组关联分析

国家自然科学基金

0+阅读 · 2015年12月31日

蛋白磷酸酶2A-C 亚基转录后修饰在细胞氧化应激反应中精细调控的研究

国家自然科学基金

0+阅读 · 2014年12月31日

从能量代谢途径探索温补肾阳法改善艾滋病中晚期免疫功能的机制

国家自然科学基金

0+阅读 · 2013年12月31日

等离子体诱导环糊精修饰石墨烯/铁氧化物对放射性核素吸附及其机理研究

国家自然科学基金

2+阅读 · 2012年12月31日

稀疏张量学习理论

国家自然科学基金

1+阅读 · 2012年12月31日

基于流形学习和时序语义网挖掘的人体运动序列分析研究

国家自然科学基金

1+阅读 · 2011年12月31日

图的几类谱及其与图的变换的关系

国家自然科学基金

0+阅读 · 2011年12月31日

体外培养间充质干细胞的群体不均一性及其治疗意义

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

沙眼衣原体主要外膜蛋白表位疫苗及其免疫效应研究

国家自然科学基金

0+阅读 · 2009年12月31日

Theoretical analysis of edit distance algorithms: an applied perspective

Arxiv

0+阅读 · 2022年4月20日

Modeling and Executing Production Processes with Capabilities and Skills using Ontologies and BPMN

Modeling and Executing Production Processes with Capabilities and Skills using Ontologies and BPMN

Arxiv

0+阅读 · 2022年4月20日

Analyzing Gender Representation in Multilingual Models

Arxiv

0+阅读 · 2022年4月20日

Practical considerations for specifying a super learner

Practical considerations for specifying a super learner

Arxiv

0+阅读 · 2022年4月19日

LwHBench: A low-level hardware component benchmark and dataset for Single Board Computers

Arxiv

0+阅读 · 2022年4月18日

Subset selection for linear mixed models

Arxiv

1+阅读 · 2022年4月18日

Many Ways to Be Lonely: Fine-Grained Characterization of Loneliness and Its Potential Changes in COVID-19

Arxiv

0+阅读 · 2022年4月16日

Towards Fine-grained Causal Reasoning and QA

Towards Fine-grained Causal Reasoning and QA

Arxiv

0+阅读 · 2022年4月15日

Affective Image Content Analysis: Two Decades Review and New Perspectives

Arxiv

16+阅读 · 2021年6月30日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

VIP会员

文章信息

相关主题

相关VIP内容

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Theoretical analysis of edit distance algorithms: an applied perspective

Arxiv

0+阅读 · 2022年4月20日

Modeling and Executing Production Processes with Capabilities and Skills using Ontologies and BPMN

Modeling and Executing Production Processes with Capabilities and Skills using Ontologies and BPMN

Arxiv

0+阅读 · 2022年4月20日

Analyzing Gender Representation in Multilingual Models

Arxiv

0+阅读 · 2022年4月20日

Practical considerations for specifying a super learner

Practical considerations for specifying a super learner

Arxiv

0+阅读 · 2022年4月19日

LwHBench: A low-level hardware component benchmark and dataset for Single Board Computers

Arxiv

0+阅读 · 2022年4月18日

Subset selection for linear mixed models

Arxiv

1+阅读 · 2022年4月18日

Many Ways to Be Lonely: Fine-Grained Characterization of Loneliness and Its Potential Changes in COVID-19

Arxiv

0+阅读 · 2022年4月16日

Towards Fine-grained Causal Reasoning and QA

Towards Fine-grained Causal Reasoning and QA

Arxiv

0+阅读 · 2022年4月15日

Affective Image Content Analysis: Two Decades Review and New Perspectives

Arxiv

16+阅读 · 2021年6月30日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

相关基金

苦荞主要产量相关性状的全基因组关联分析

国家自然科学基金

0+阅读 · 2015年12月31日

蛋白磷酸酶2A-C 亚基转录后修饰在细胞氧化应激反应中精细调控的研究

国家自然科学基金

0+阅读 · 2014年12月31日

从能量代谢途径探索温补肾阳法改善艾滋病中晚期免疫功能的机制

国家自然科学基金

0+阅读 · 2013年12月31日

等离子体诱导环糊精修饰石墨烯/铁氧化物对放射性核素吸附及其机理研究

国家自然科学基金

2+阅读 · 2012年12月31日

稀疏张量学习理论

国家自然科学基金

1+阅读 · 2012年12月31日

基于流形学习和时序语义网挖掘的人体运动序列分析研究

国家自然科学基金

1+阅读 · 2011年12月31日

图的几类谱及其与图的变换的关系

国家自然科学基金

0+阅读 · 2011年12月31日

体外培养间充质干细胞的群体不均一性及其治疗意义

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

沙眼衣原体主要外膜蛋白表位疫苗及其免疫效应研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员