Amidst the COVID-19 pandemic, the authors of this paper organized a Reinforcement Learning (RL) course for a graduate school in the field of data science. We describe the strategy and materials for creating an exciting learning experience despite the ubiquitous Zoom fatigue and evaluate the course qualitatively. The key organizational features are a focus on a competitive hands-on setting in teams, supported by a minimum of lectures providing the essential background on RL. The practical part of the course revolved around Hearts Gym, an RL environment for the card game Hearts that we developed as an entry-level tutorial to RL. Participants were tasked with training agents to explore reward shaping and other RL hyperparameters. For a final evaluation, the agents of the participants competed against each other.
翻译:在COVID-19大流行的情况下,本文的作者为一所研究生学校举办了一期数据科学领域的强化学习课程,我们描述了在无处不在的Zomo疲劳状态下创造令人振奋的学习经验的战略和材料,并对课程进行了质量评估,主要组织特征是侧重于团队中的竞争性实践设置,并辅之以至少提供RL基本背景的讲课。 课程的实际部分围绕Hearts Gym,即我们作为RL初级辅导而开发的纸牌游戏心脏的RL环境。 学员们的任务是培训代理人员探索奖励的形成和其他RL双立度计。 最后评估是,学员们相互竞争。