【强化学习研讨会|Microsoft Research】选择性噪声注入在强化学习应用，微软高级研究员Sam Devlin - 专知VIP

会员服务 ·

0

Microsoft Research · 强化学习 · Sam Devlin · 深度学习 · 3D游戏开发 ·

2019 年 10 月 3 日

【强化学习研讨会|Microsoft Research】选择性噪声注入在强化学习应用，微软高级研究员Sam Devlin

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

主题： Generalization in Reinforcement Learning with Selective Noise Injection

摘要： 强化学习是机器学习中唯一一种通常被允许在其测试集中进行训练的形式。特别是深度强化学习已被证明可以适应其所训练的环境。在本次演讲中，我将讨论我们最近两篇论文（1）显示域随机化在看不见的3D迷宫中导航的应用（在2019年IEEE游戏大会上发布）; （2）建议通过变化信息瓶颈进行选择性噪声注入，以将通用性提高到2D平台开发工具CoinRun的未知测试水平（NeurIPS 2019）。

嘉宾介绍： Sam Devlin，Microsoft Research高级研究员，于2009年获得约克大学计算机系统和软件工程硕士学位，其中包括一年与BAE Systems的团队合作。完成该学位后，从事传统的商业游戏AI的研究，将行为树和导航网格生成集成到开放源代码游戏引擎CrystalSpace中，作为2009年Google Summer of Code计划的一部分，2013年，完成了博士学位，在约克大学（University of York）进行多智能体强化学习，并访问了由桑坦德国际连接奖（Santander International Connections Award）资助的俄勒冈州立大学。

成为VIP会员查看完整内容

8

相关内容

Microsoft Research

Microsoft Research

Microsoft Research

【ACL2020-伯克利】预训练Transformer提高分布外鲁棒性

【ACL2020-伯克利】预训练Transformer提高分布外鲁棒性

专知会员服务

20+阅读 · 2020年4月14日

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

专知会员服务

21+阅读 · 2019年12月12日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

121+阅读 · 2019年11月24日

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

专知会员服务

84+阅读 · 2019年11月15日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】减少强化学习的样本复杂性，171页pdf，多伦多大学|Sheila McIlraith

【强化学习研讨会|Microsoft Research】减少强化学习的样本复杂性，171页pdf，多伦多大学|Sheila McIlraith

专知会员服务

14+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】政策改进学习（Learning for policy improvement），卡内基梅隆大学教授| Geoff Gordon

【强化学习研讨会|Microsoft Research】政策改进学习（Learning for policy improvement），卡内基梅隆大学教授| Geoff Gordon

专知会员服务

13+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

专知会员服务

16+阅读 · 2019年10月3日

【综述】多智能体深度强化学习综述，附49页PDF

专知会员服务

213+阅读 · 2019年8月30日

【微软Alekh等开放新书】强化学习理论与算法，83页pdf，了解最新进展

【微软Alekh等开放新书】强化学习理论与算法，83页pdf，了解最新进展

专知

25+阅读 · 2019年11月23日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

RL解决'LunarLander-v2' (SOTA)

RL解决'LunarLander-v2' (SOTA)

CreateAMind

62+阅读 · 2019年9月27日

谷歌更强 NLP 模型 XLNet 开源：20 项任务全面碾压 BERT！

谷歌更强 NLP 模型 XLNet 开源：20 项任务全面碾压 BERT！

雷锋网

5+阅读 · 2019年6月20日

开发 | 谷歌更强NLP模型XLNet开源：20项任务全面碾压BERT！

开发 | 谷歌更强NLP模型XLNet开源：20项任务全面碾压BERT！

AI科技评论

6+阅读 · 2019年6月20日

【ICML2019】微软智能对话方法教程，130页PPT带你了解最新研究进展

【ICML2019】微软智能对话方法教程，130页PPT带你了解最新研究进展

专知

15+阅读 · 2019年6月12日

【微软亚研130PPT教程】强化学习简介

【微软亚研130PPT教程】强化学习简介

专知

36+阅读 · 2018年10月26日

总览智能对话系统（3位微软与谷歌技术大牛联合出品）

总览智能对话系统（3位微软与谷歌技术大牛联合出品）

智能交通技术

8+阅读 · 2018年8月15日

微软研究院开源项目TextWorld：可用于强化学习训练的文本游戏

微软研究院开源项目TextWorld：可用于强化学习训练的文本游戏

专知

5+阅读 · 2018年8月11日

微软与谷歌研究员联合出品：175页 PPT 带你总览对话系统全貌

微软与谷歌研究员联合出品：175页 PPT 带你总览对话系统全貌

专知

9+阅读 · 2018年7月12日

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

Next Item Recommendation with Self-Attention

Next Item Recommendation with Self-Attention

Arxiv

5+阅读 · 2018年8月25日

Self-Attention Recurrent Network for Saliency Detection

Self-Attention Recurrent Network for Saliency Detection

Arxiv

5+阅读 · 2018年8月5日

FuzzerGym: A Competitive Framework for Fuzzing and Learning

FuzzerGym: A Competitive Framework for Fuzzing and Learning

Arxiv

4+阅读 · 2018年7月19日

Reinforced Self-Attention Network: a Hybrid of Hard and Soft Attention for Sequence Modeling

Arxiv

3+阅读 · 2018年7月5日

Multiagent Soft Q-Learning

Arxiv

11+阅读 · 2018年4月25日

Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

Arxiv

6+阅读 · 2018年4月18日

Self-Attention with Relative Position Representations

Arxiv

27+阅读 · 2018年4月12日

Stacked Cross Attention for Image-Text Matching

Arxiv

3+阅读 · 2018年3月21日

Natural Language Guided Visual Relationship Detection

Arxiv

3+阅读 · 2017年11月21日

VIP会员

相关主题

Microsoft Research

相关VIP内容

【ACL2020-伯克利】预训练Transformer提高分布外鲁棒性

【ACL2020-伯克利】预训练Transformer提高分布外鲁棒性

专知会员服务

20+阅读 · 2020年4月14日

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

【技术报告】诺亚开源中文预训练语言模型“哪吒”（NEZHA: Neural Contextualized Representation for Chinese Language Understanding）

专知会员服务

21+阅读 · 2019年12月12日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

121+阅读 · 2019年11月24日

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

专知会员服务

84+阅读 · 2019年11月15日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

【强化学习研讨会|Microsoft Research】多智能体强化学习 Scalable and Robust Multi-Agent Reinforcement Learning，46页pdf，美国东北大学|Christopher Amato

专知会员服务

26+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】减少强化学习的样本复杂性，171页pdf，多伦多大学|Sheila McIlraith

【强化学习研讨会|Microsoft Research】减少强化学习的样本复杂性，171页pdf，多伦多大学|Sheila McIlraith

专知会员服务

14+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】政策改进学习（Learning for policy improvement），卡内基梅隆大学教授| Geoff Gordon

【强化学习研讨会|Microsoft Research】政策改进学习（Learning for policy improvement），卡内基梅隆大学教授| Geoff Gordon

专知会员服务

13+阅读 · 2019年10月3日

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

【强化学习研讨会|Microsoft Research】安全公平的机器学习（Safe and Fair Machine Learning）

专知会员服务

16+阅读 · 2019年10月3日

【综述】多智能体深度强化学习综述，附49页PDF

专知会员服务

213+阅读 · 2019年8月30日

热门VIP内容

开通专知VIP会员享更多权益服务

《战区安全决策课程体系》最新244页

《"无人机航母"原型平台》

任务规划与地形分析：现代复杂环境作战导航体系

《攻击场景描述形式化模型研究》

相关资讯

【微软Alekh等开放新书】强化学习理论与算法，83页pdf，了解最新进展

【微软Alekh等开放新书】强化学习理论与算法，83页pdf，了解最新进展

专知

25+阅读 · 2019年11月23日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

RL解决'LunarLander-v2' (SOTA)

RL解决'LunarLander-v2' (SOTA)

CreateAMind

62+阅读 · 2019年9月27日

谷歌更强 NLP 模型 XLNet 开源：20 项任务全面碾压 BERT！

谷歌更强 NLP 模型 XLNet 开源：20 项任务全面碾压 BERT！

雷锋网

5+阅读 · 2019年6月20日

开发 | 谷歌更强NLP模型XLNet开源：20项任务全面碾压BERT！

开发 | 谷歌更强NLP模型XLNet开源：20项任务全面碾压BERT！

AI科技评论

6+阅读 · 2019年6月20日

【ICML2019】微软智能对话方法教程，130页PPT带你了解最新研究进展

【ICML2019】微软智能对话方法教程，130页PPT带你了解最新研究进展

专知

15+阅读 · 2019年6月12日

【微软亚研130PPT教程】强化学习简介

【微软亚研130PPT教程】强化学习简介

专知

36+阅读 · 2018年10月26日

总览智能对话系统（3位微软与谷歌技术大牛联合出品）

总览智能对话系统（3位微软与谷歌技术大牛联合出品）

智能交通技术

8+阅读 · 2018年8月15日

微软研究院开源项目TextWorld：可用于强化学习训练的文本游戏

微软研究院开源项目TextWorld：可用于强化学习训练的文本游戏

专知

5+阅读 · 2018年8月11日

微软与谷歌研究员联合出品：175页 PPT 带你总览对话系统全貌

微软与谷歌研究员联合出品：175页 PPT 带你总览对话系统全貌

专知

9+阅读 · 2018年7月12日

相关论文

Pre-training Text Representations as Meta Learning

Arxiv

13+阅读 · 2020年4月12日

Next Item Recommendation with Self-Attention

Next Item Recommendation with Self-Attention

Arxiv

5+阅读 · 2018年8月25日

Self-Attention Recurrent Network for Saliency Detection

Self-Attention Recurrent Network for Saliency Detection

Arxiv

5+阅读 · 2018年8月5日

FuzzerGym: A Competitive Framework for Fuzzing and Learning

FuzzerGym: A Competitive Framework for Fuzzing and Learning

Arxiv

4+阅读 · 2018年7月19日

Reinforced Self-Attention Network: a Hybrid of Hard and Soft Attention for Sequence Modeling

Arxiv

3+阅读 · 2018年7月5日

Multiagent Soft Q-Learning

Arxiv

11+阅读 · 2018年4月25日

Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

Arxiv

6+阅读 · 2018年4月18日

Self-Attention with Relative Position Representations

Arxiv

27+阅读 · 2018年4月12日

Stacked Cross Attention for Image-Text Matching

Arxiv

3+阅读 · 2018年3月21日

Natural Language Guided Visual Relationship Detection

Arxiv

3+阅读 · 2017年11月21日

微信扫码咨询专知VIP会员