This article is a gentle discussion about the field of reinforcement learning for real life, about opportunities and challenges, with perspectives and without technical details, touching a broad range of topics. The article is based on both historical and recent research papers, surveys, tutorials, talks, blogs, and books. Various groups of readers, like researchers, engineers, students, managers, investors, officers, and people wanting to know more about the field, may find the article interesting. In this article, we first give a brief introduction to reinforcement learning (RL), and its relationship with deep learning, machine learning and AI. Then we discuss opportunities of RL, in particular, applications in products and services, games, recommender systems, robotics, transportation, economics and finance, healthcare, education, combinatorial optimization, computer systems, and science and engineering. The we discuss challenges, in particular, 1) foundation, 2) representation, 3) reward, 4) model, simulation, planning, and benchmarks, 5) learning to learn a.k.a. meta-learning, 6) off-policy/offline learning, 7) software development and deployment, 8) business perspectives, and 9) more challenges. We conclude with a discussion, attempting to answer: "Why has RL not been widely adopted in practice yet?" and "When is RL helpful?".
翻译:文章以历史和最近的研究论文、调查、辅导、演讲、博客和书籍为基础。各种读者群体,如研究人员、工程师、学生、管理人员、投资者、官员和希望了解更多实地情况的人,可能会发现这篇文章有趣。在本篇文章中,我们首先简要介绍强化学习(RL)及其与深层学习、机器学习和AI的关系。然后我们讨论学习机会,特别是产品和服务、游戏、推荐系统、机器人、运输、经济和金融、保健、教育、组合优化、计算机系统、科学和工程等方面的应用。我们讨论了挑战,特别是:(1)基础,(2)代表性,(3)奖励,(4)模型,模拟,规划和基准,(5)学习a.k.a.元学习,(6)非政策/脱线学习,(7)软件开发和部署,8)商业观点和9。我们后来广泛讨论了挑战,试图回答,但最后是,“我们试图回答,现在还是尝试了。