【强化学习研讨会|Microsoft Research】政策改进学习（Learning for policy improvement），卡内基梅隆大学教授| Geoff Gordon - 专知VIP

会员服务 ·

1

人工智能 · 强化学习 · 微软研究院 · 卡内基梅隆大学 (Carnegie Mellon University) · Geoff Gordon ·

2019 年 10 月 3 日

【强化学习研讨会|Microsoft Research】政策改进学习（Learning for policy improvement），卡内基梅隆大学教授| Geoff Gordon

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

主题： Learning for policy improvement

摘要： 强化学习在经验易获得的领域取得了许多成功，如电子游戏或棋盘游戏。这类区域的RL算法通常基于梯度下降：它们以较小的学习率进行许多噪声更新。相反，我们研究每次更新花费更多计算的算法，试图减少噪声并进行更大的更新；当经验比计算时间更昂贵时，这样的算法是合适的。特别地，我们看几种基于近似策略迭代的方法。

作者简介： Geoff Gordon博士是微软研究蒙特勒实验室的研究主任，也是卡内基梅隆大学机器学习系的教授。他还担任过机械学习系的临时系主任和教育副系主任。戈登博士的研究集中在能够进行长期思考的人工智能系统上，比如提前推理以解决问题、计划一系列行动或从观察中推断出看不见的特性。特别是，他着眼于如何将机器学习与这些长期思考任务结合起来。1991年，戈登博士在康奈尔大学获得计算机科学学士学位，1999年在卡内基梅隆大学获得计算机科学博士学位。他的研究兴趣包括人工智能、统计机器学习、教育数据、博弈论、多机器人系统，以及概率、对抗和一般和领域的规划。他之前的任命包括斯坦福大学计算机科学系的客座教授和圣地亚哥燃烧玻璃技术的首席科学家。

成为VIP会员查看完整内容

13

相关内容

人工智能

人工智能(Artificial Intelligence, AI )是研究、开发用于模拟、延伸和扩展人的智能的理论、方法、技术及应用系统的一门新的技术科学。人工智能是计算机科学的一个分支。

【WSDM2020】小数据学习，124页ppt，Learning with Small Data，宾夕法尼亚州立大学

【WSDM2020】小数据学习，124页ppt，Learning with Small Data，宾夕法尼亚州立大学

专知会员服务

137+阅读 · 2020年2月6日

普林斯顿大学经典书《在线凸优化导论》，178页pdf

普林斯顿大学经典书《在线凸优化导论》，178页pdf

专知会员服务

186+阅读 · 2020年2月3日

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

专知会员服务

84+阅读 · 2019年11月15日

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

专知会员服务

27+阅读 · 2019年11月10日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】选择性噪声注入在强化学习应用，微软高级研究员Sam Devlin

【强化学习研讨会|Microsoft Research】选择性噪声注入在强化学习应用，微软高级研究员Sam Devlin

专知会员服务

8+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】减少强化学习的样本复杂性，171页pdf，多伦多大学|Sheila McIlraith

【强化学习研讨会|Microsoft Research】减少强化学习的样本复杂性，171页pdf，多伦多大学|Sheila McIlraith

专知会员服务

14+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

专知会员服务

16+阅读 · 2019年10月3日

【ICML2019 tutorial】安全机器学习（Safe Machine Learning），Silvia Chiappa，Jan Leike

【ICML2019 tutorial】安全机器学习（Safe Machine Learning），Silvia Chiappa，Jan Leike

专知会员服务

23+阅读 · 2019年6月10日

【ICML2019 Tutorials】元学习：从小样本学习到快速强化学习(Meta-Learning: from Few-Shot Learning to Rapid Reinforcement Learning)，Google Brain的研究科学家| Chelsea Finn，加州大学伯克利分校| Sergey Levine

【ICML2019 Tutorials】元学习：从小样本学习到快速强化学习(Meta-Learning: from Few-Shot Learning to Rapid Reinforcement Learning)，Google Brain的研究科学家| Chelsea Finn，加州大学伯克利分校| Sergey Levine

专知会员服务

55+阅读 · 2019年6月10日

【微软Alekh等开放新书】强化学习理论与算法，83页pdf，了解最新进展

【微软Alekh等开放新书】强化学习理论与算法，83页pdf，了解最新进展

专知

25+阅读 · 2019年11月23日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

《深度无监督学习》伯克利Pieter Abbeel新开课程（含视频PPT）

《深度无监督学习》伯克利Pieter Abbeel新开课程（含视频PPT）

专知

21+阅读 · 2019年2月19日

伯克利大学《深度强化学习》更新 | 第二讲：监督学习和模仿学习

伯克利大学《深度强化学习》更新 | 第二讲：监督学习和模仿学习

AI科技评论

5+阅读 · 2019年1月10日

元学习究竟是什么？这《基于梯度的元学习》199页伯克利博士论文带你回顾元学习最新发展脉络

元学习究竟是什么？这《基于梯度的元学习》199页伯克利博士论文带你回顾元学习最新发展脉络

专知

39+阅读 · 2018年12月27日

已删除

将门创投

5+阅读 · 2018年11月27日

强化学习十大原则

强化学习十大原则

专知

12+阅读 · 2018年9月17日

微软研究院开源项目TextWorld：可用于强化学习训练的文本游戏

微软研究院开源项目TextWorld：可用于强化学习训练的文本游戏

专知

5+阅读 · 2018年8月11日

Richard S. Sutton经典图书：《强化学习导论》第二版（附PDF下载）

Richard S. Sutton经典图书：《强化学习导论》第二版（附PDF下载）

专知

31+阅读 · 2018年4月10日

强化学习之父Sutton访谈：创造AI，就是创造一种新的人类

强化学习之父Sutton访谈：创造AI，就是创造一种新的人类

新智元

4+阅读 · 2017年11月27日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

Risk-Aware Active Inverse Reinforcement Learning

Risk-Aware Active Inverse Reinforcement Learning

Arxiv

8+阅读 · 2019年1月8日

On Improving Decentralized Hysteretic Deep Reinforcement Learning

On Improving Decentralized Hysteretic Deep Reinforcement Learning

Arxiv

4+阅读 · 2018年12月15日

IRLAS: Inverse Reinforcement Learning for Architecture Search

IRLAS: Inverse Reinforcement Learning for Architecture Search

Arxiv

4+阅读 · 2018年12月14日

Learning under Misspecified Objective Spaces

Arxiv

3+阅读 · 2018年10月11日

Hierarchical Deep Multiagent Reinforcement Learning

Hierarchical Deep Multiagent Reinforcement Learning

Arxiv

8+阅读 · 2018年9月25日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

Multiagent Soft Q-Learning

Arxiv

11+阅读 · 2018年4月25日

Video Captioning via Hierarchical Reinforcement Learning

Arxiv

20+阅读 · 2018年3月29日

VIP会员

相关主题

微软研究院

卡内基梅隆大学 (Carnegie Mellon University)

相关VIP内容

【WSDM2020】小数据学习，124页ppt，Learning with Small Data，宾夕法尼亚州立大学

【WSDM2020】小数据学习，124页ppt，Learning with Small Data，宾夕法尼亚州立大学

专知会员服务

137+阅读 · 2020年2月6日

普林斯顿大学经典书《在线凸优化导论》，178页pdf

普林斯顿大学经典书《在线凸优化导论》，178页pdf

专知会员服务

186+阅读 · 2020年2月3日

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

专知会员服务

84+阅读 · 2019年11月15日

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

【纽约大学-AI研讨会】现代人工智能（Modern Artificial Intelligence）

专知会员服务

27+阅读 · 2019年11月10日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】选择性噪声注入在强化学习应用，微软高级研究员Sam Devlin

【强化学习研讨会|Microsoft Research】选择性噪声注入在强化学习应用，微软高级研究员Sam Devlin

专知会员服务

8+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】减少强化学习的样本复杂性，171页pdf，多伦多大学|Sheila McIlraith

【强化学习研讨会|Microsoft Research】减少强化学习的样本复杂性，171页pdf，多伦多大学|Sheila McIlraith

专知会员服务

14+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

专知会员服务

16+阅读 · 2019年10月3日

【ICML2019 tutorial】安全机器学习（Safe Machine Learning），Silvia Chiappa，Jan Leike

【ICML2019 tutorial】安全机器学习（Safe Machine Learning），Silvia Chiappa，Jan Leike

专知会员服务

23+阅读 · 2019年6月10日

【ICML2019 Tutorials】元学习：从小样本学习到快速强化学习(Meta-Learning: from Few-Shot Learning to Rapid Reinforcement Learning)，Google Brain的研究科学家| Chelsea Finn，加州大学伯克利分校| Sergey Levine

【ICML2019 Tutorials】元学习：从小样本学习到快速强化学习(Meta-Learning: from Few-Shot Learning to Rapid Reinforcement Learning)，Google Brain的研究科学家| Chelsea Finn，加州大学伯克利分校| Sergey Levine

专知会员服务

55+阅读 · 2019年6月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

【斯坦福博士论文】数据、决策与依赖：构建可信人工智能的挑战

人工智能时代背景下的未来海战

接触战中的无人机优势：美军旅级部队面临的小型无人机系统挑战与调整

相关资讯

【微软Alekh等开放新书】强化学习理论与算法，83页pdf，了解最新进展

【微软Alekh等开放新书】强化学习理论与算法，83页pdf，了解最新进展

专知

25+阅读 · 2019年11月23日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

《深度无监督学习》伯克利Pieter Abbeel新开课程（含视频PPT）

《深度无监督学习》伯克利Pieter Abbeel新开课程（含视频PPT）

专知

21+阅读 · 2019年2月19日

伯克利大学《深度强化学习》更新 | 第二讲：监督学习和模仿学习

伯克利大学《深度强化学习》更新 | 第二讲：监督学习和模仿学习

AI科技评论

5+阅读 · 2019年1月10日

元学习究竟是什么？这《基于梯度的元学习》199页伯克利博士论文带你回顾元学习最新发展脉络

元学习究竟是什么？这《基于梯度的元学习》199页伯克利博士论文带你回顾元学习最新发展脉络

专知

39+阅读 · 2018年12月27日

已删除

将门创投

5+阅读 · 2018年11月27日

强化学习十大原则

强化学习十大原则

专知

12+阅读 · 2018年9月17日

微软研究院开源项目TextWorld：可用于强化学习训练的文本游戏

微软研究院开源项目TextWorld：可用于强化学习训练的文本游戏

专知

5+阅读 · 2018年8月11日

Richard S. Sutton经典图书：《强化学习导论》第二版（附PDF下载）

Richard S. Sutton经典图书：《强化学习导论》第二版（附PDF下载）

专知

31+阅读 · 2018年4月10日

强化学习之父Sutton访谈：创造AI，就是创造一种新的人类

强化学习之父Sutton访谈：创造AI，就是创造一种新的人类

新智元

4+阅读 · 2017年11月27日

相关论文

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

Risk-Aware Active Inverse Reinforcement Learning

Risk-Aware Active Inverse Reinforcement Learning

Arxiv

8+阅读 · 2019年1月8日

On Improving Decentralized Hysteretic Deep Reinforcement Learning

On Improving Decentralized Hysteretic Deep Reinforcement Learning

Arxiv

4+阅读 · 2018年12月15日

IRLAS: Inverse Reinforcement Learning for Architecture Search

IRLAS: Inverse Reinforcement Learning for Architecture Search

Arxiv

4+阅读 · 2018年12月14日

Learning under Misspecified Objective Spaces

Arxiv

3+阅读 · 2018年10月11日

Hierarchical Deep Multiagent Reinforcement Learning

Hierarchical Deep Multiagent Reinforcement Learning

Arxiv

8+阅读 · 2018年9月25日

A Multi-Objective Deep Reinforcement Learning Framework

A Multi-Objective Deep Reinforcement Learning Framework

Arxiv

16+阅读 · 2018年6月27日

Multiagent Soft Q-Learning

Arxiv

11+阅读 · 2018年4月25日

Video Captioning via Hierarchical Reinforcement Learning

Arxiv

20+阅读 · 2018年3月29日

微信扫码咨询专知VIP会员