MIT新书《强化学习与最优控制》,REINFORCEMENT LEARNING AND OPTIMAL CONTROL https://web.mit.edu/dimitrib/www/Slides_Lecture13_RLOC.pdf https://web.mit.edu/dimitrib/www/RLbook.html