【干货书】强化学习算法，98页pdf综合讲解人工智能和机器学习 - 专知

会员服务 ·

0

【干货书】强化学习算法，98页pdf综合讲解人工智能和机器学习

2021 年 2 月 21 日 专知

强化学习是一种学习范式，它关注于如何学习控制一个系统，从而最大化表达一个长期目标的数值性能度量。强化学习与监督学习的区别在于，对于学习者的预测，只向学习者提供部分反馈。此外，预测还可能通过影响被控系统的未来状态而产生长期影响。因此，时间起着特殊的作用。强化学习的目标是开发高效的学习算法，以及了解算法的优点和局限性。强化学习具有广泛的实际应用价值，从人工智能到运筹学或控制工程等领域。在这本书中，我们重点关注那些基于强大的动态规划理论的强化学习算法。我们给出了一个相当全面的学习问题目录，描述了核心思想，关注大量的最新算法，然后讨论了它们的理论性质和局限性。

Preface ix
Acknowledgments xiii
Markov Decision Processes 1

Preliminaries 1
Markov Decision Processes 1
Value functions 6
Dynamic programming algorithms for solving MDPs 10

Value Prediction Problems 11

TD(lambda) with function approximation 22
Gradient temporal difference learning 25
Least-squares methods 27
The choice of the function space 33
Tabular TD(0) 11
Every-visit Monte-Carlo 14
TD(lambda): Unifying Monte-Carlo and TD(0) 16

Temporal difference learning in finite state spaces 11
Algorithms for large state spaces 18

Control 37

Implementing a critic 54
Implementing an actor 56
Q-learning in finite MDPs 47
Q-learning with function approximation 49
Online learning in bandits 38
Active learning in bandits 40
Active learning in Markov Decision Processes 41
Online learning in Markov Decision Processes 42

A catalog of learning problems 37
Closed-loop interactive learning 38
Direct methods 47
Actor-critic methods 52

For Further Exploration 63

Further reading 63
Applications 63
Software 64

Appendix: The Theory of Discounted Markovian Decision Processes 65

A.1 Contractions and Banach’s fixed-point theorem 65
A.2 Application to MDPs 69

Bibliography 73
Author's Biography 89

https://sites.ualberta.ca/~szepesva/rlbook.html

专知便捷查看

便捷下载，请关注专知公众号（点击上方蓝色专知关注）

后台回复“A98” 可以获取《【干货书】强化学习算法，98页pdf综合讲解人工智能和机器学习》专知下载链接索引

专知，专业可信的人工智能知识分发，让认知协作更快更好！欢迎注册登录专知www.zhuanzhi.ai，获取5000+AI主题干货知识资料！

欢迎微信扫一扫加入专知人工智能知识星球群，获取最新AI专业干货知识教程资料和与专家交流咨询！

点击“ 阅读原文 ”，了解使用专知 ，查看获取5000+AI主题知识资源

登录查看更多

0

相关内容

强化学习算法

强化学习算法

【普林斯顿干货书】强化学习与随机优化，728页pdf阐述序列决策统一框架

【普林斯顿干货书】强化学习与随机优化，728页pdf阐述序列决策统一框架

专知会员服务

129+阅读 · 2021年4月25日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【2021新书】模式、预测与行动：机器学习的故事，308页pdf

【2021新书】模式、预测与行动：机器学习的故事，308页pdf

专知会员服务

84+阅读 · 2021年2月15日

【MIT干货书】机器学习算法视角，126页pdf

【MIT干货书】机器学习算法视角，126页pdf

专知会员服务

78+阅读 · 2021年1月25日

最新《计算控制理论》笔记与课程，60页pdf

专知会员服务

54+阅读 · 2020年12月24日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【圣经书】《强化学习导论(2nd)》电子书与代码，548页pdf

【圣经书】《强化学习导论(2nd)》电子书与代码，548页pdf

专知会员服务

208+阅读 · 2020年5月22日

【经典书】机器学习高斯过程，266页pdf

【经典书】机器学习高斯过程，266页pdf

专知会员服务

200+阅读 · 2020年5月2日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

解读！清华、谷歌等10篇强化学习论文总结

解读！清华、谷歌等10篇强化学习论文总结

学术头条

7+阅读 · 2019年11月18日

67页PPT▍AI时代的机器学习算法、应用及数据处理（附下载）

67页PPT▍AI时代的机器学习算法、应用及数据处理（附下载）

36大数据

28+阅读 · 2019年4月15日

强化学习精品书籍

强化学习精品书籍

平均机器

26+阅读 · 2019年1月2日

【伯克利博士论文】如何让机器人多技能？通过最大熵强化学习(107页pdf)

【伯克利博士论文】如何让机器人多技能？通过最大熵强化学习(107页pdf)

专知

12+阅读 · 2018年12月22日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

【干货】ICML2018：63篇强化学习论文精华解读！

【干货】ICML2018：63篇强化学习论文精华解读！

新智元

7+阅读 · 2018年7月24日

【ICML2018】63篇强化学习论文全解读

【ICML2018】63篇强化学习论文全解读

专知

7+阅读 · 2018年7月24日

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

专知

13+阅读 · 2017年12月10日

机器学习(28)【降维】之sklearn中PCA库讲解与实战

机器学习(28)【降维】之sklearn中PCA库讲解与实战

机器学习算法与Python学习

8+阅读 · 2017年11月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Mining Implicit Entity Preference from User-Item Interaction Data for Knowledge Graph Completion via Adversarial Learning

Mining Implicit Entity Preference from User-Item Interaction Data for Knowledge Graph Completion via Adversarial Learning

Arxiv

6+阅读 · 2020年3月28日

A Comprehensive Comparison of Unsupervised Network Representation Learning Methods

Arxiv

5+阅读 · 2019年3月19日

Deep Reinforcement Learning: An Overview

Deep Reinforcement Learning: An Overview

Arxiv

17+阅读 · 2018年11月26日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

Deep Learning

Arxiv

6+阅读 · 2018年8月3日

A Restricted-Domain Dual Formulation for Two-Phase Image Segmentation

A Restricted-Domain Dual Formulation for Two-Phase Image Segmentation

Arxiv

3+阅读 · 2018年7月30日

Variational Bayesian Reinforcement Learning with Regret Bounds

Arxiv

3+阅读 · 2018年7月25日

FuzzerGym: A Competitive Framework for Fuzzing and Learning

FuzzerGym: A Competitive Framework for Fuzzing and Learning

Arxiv

4+阅读 · 2018年7月19日

A Tour of Reinforcement Learning: The View from Continuous Control

Arxiv

6+阅读 · 2018年6月25日

Regularized Singular Value Decomposition and Application to Recommender System

Arxiv

6+阅读 · 2018年4月13日

VIP会员

相关主题

强化学习算法

相关VIP内容

【普林斯顿干货书】强化学习与随机优化，728页pdf阐述序列决策统一框架

【普林斯顿干货书】强化学习与随机优化，728页pdf阐述序列决策统一框架

专知会员服务

129+阅读 · 2021年4月25日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【2021新书】模式、预测与行动：机器学习的故事，308页pdf

【2021新书】模式、预测与行动：机器学习的故事，308页pdf

专知会员服务

84+阅读 · 2021年2月15日

【MIT干货书】机器学习算法视角，126页pdf

【MIT干货书】机器学习算法视角，126页pdf

专知会员服务

78+阅读 · 2021年1月25日

最新《计算控制理论》笔记与课程，60页pdf

专知会员服务

54+阅读 · 2020年12月24日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【圣经书】《强化学习导论(2nd)》电子书与代码，548页pdf

【圣经书】《强化学习导论(2nd)》电子书与代码，548页pdf

专知会员服务

208+阅读 · 2020年5月22日

【经典书】机器学习高斯过程，266页pdf

【经典书】机器学习高斯过程，266页pdf

专知会员服务

200+阅读 · 2020年5月2日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

解读！清华、谷歌等10篇强化学习论文总结

解读！清华、谷歌等10篇强化学习论文总结

学术头条

7+阅读 · 2019年11月18日

67页PPT▍AI时代的机器学习算法、应用及数据处理（附下载）

67页PPT▍AI时代的机器学习算法、应用及数据处理（附下载）

36大数据

28+阅读 · 2019年4月15日

强化学习精品书籍

强化学习精品书籍

平均机器

26+阅读 · 2019年1月2日

【伯克利博士论文】如何让机器人多技能？通过最大熵强化学习(107页pdf)

【伯克利博士论文】如何让机器人多技能？通过最大熵强化学习(107页pdf)

专知

12+阅读 · 2018年12月22日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

【干货】ICML2018：63篇强化学习论文精华解读！

【干货】ICML2018：63篇强化学习论文精华解读！

新智元

7+阅读 · 2018年7月24日

【ICML2018】63篇强化学习论文全解读

【ICML2018】63篇强化学习论文全解读

专知

7+阅读 · 2018年7月24日

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

专知

13+阅读 · 2017年12月10日

机器学习(28)【降维】之sklearn中PCA库讲解与实战

机器学习(28)【降维】之sklearn中PCA库讲解与实战

机器学习算法与Python学习

8+阅读 · 2017年11月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Mining Implicit Entity Preference from User-Item Interaction Data for Knowledge Graph Completion via Adversarial Learning

Mining Implicit Entity Preference from User-Item Interaction Data for Knowledge Graph Completion via Adversarial Learning

Arxiv

6+阅读 · 2020年3月28日

A Comprehensive Comparison of Unsupervised Network Representation Learning Methods

Arxiv

5+阅读 · 2019年3月19日

Deep Reinforcement Learning: An Overview

Deep Reinforcement Learning: An Overview

Arxiv

17+阅读 · 2018年11月26日

Causal Embeddings for Recommendation

Arxiv

23+阅读 · 2018年8月3日

Deep Learning

Arxiv

6+阅读 · 2018年8月3日

A Restricted-Domain Dual Formulation for Two-Phase Image Segmentation

A Restricted-Domain Dual Formulation for Two-Phase Image Segmentation

Arxiv

3+阅读 · 2018年7月30日

Variational Bayesian Reinforcement Learning with Regret Bounds

Arxiv

3+阅读 · 2018年7月25日

FuzzerGym: A Competitive Framework for Fuzzing and Learning

FuzzerGym: A Competitive Framework for Fuzzing and Learning

Arxiv

4+阅读 · 2018年7月19日

A Tour of Reinforcement Learning: The View from Continuous Control

Arxiv

6+阅读 · 2018年6月25日

Regularized Singular Value Decomposition and Application to Recommender System

Arxiv

6+阅读 · 2018年4月13日

大家都在搜

CMU博士论文

无人机集群

软件无线电

国防科技创新

OpenKG开源系列 | 海洋鱼类百科知识图谱（浙江大学）

微信扫码咨询专知VIP会员