The high emission and low energy efficiency caused by internal combustion engines (ICE) have become unacceptable under environmental regulations and the energy crisis. As a promising alternative solution, multi-power source electric vehicles (MPS-EVs) introduce different clean energy systems to improve powertrain efficiency. The energy management strategy (EMS) is a critical technology for MPS-EVs to maximize efficiency, fuel economy, and range. Reinforcement learning (RL) has become an effective methodology for the development of EMS. RL has received continuous attention and research, but there is still a lack of systematic analysis of the design elements of RL-based EMS. To this end, this paper presents an in-depth analysis of the current research on RL-based EMS (RL-EMS) and summarizes the design elements of RL-based EMS. This paper first summarizes the previous applications of RL in EMS from five aspects: algorithm, perception scheme, decision scheme, reward function, and innovative training method. The contribution of advanced algorithms to the training effect is shown, the perception and control schemes in the literature are analyzed in detail, different reward function settings are classified, and innovative training methods with their roles are elaborated. Finally, by comparing the development routes of RL and RL-EMS, this paper identifies the gap between advanced RL solutions and existing RL-EMS. Finally, this paper suggests potential development directions for implementing advanced artificial intelligence (AI) solutions in EMS.
翻译:根据环境条例和能源危机,内部燃烧引擎(ICE)造成的高排放和低能效已成为不可接受的。作为一种有希望的替代解决办法,多动力源电动车辆(MPS-EVs)引入了不同的清洁能源系统,以提高电力培训效率。能源管理战略(EMS)是MPS-EV的一项关键技术,可以最大限度地提高效率、燃料经济和范围。强化学习(RL)已成为发展EMS的有效方法。RL不断受到关注和研究,但仍缺乏对基于RL的EM的设计要素的系统分析。为此,本文深入分析了目前对基于RL的EMS(RL-EMS)的研究,总结了基于RL的EMS的设计要素。本文首先总结了RL以前在EMS的五个方面的应用:算法、观念计划、决策计划、奖赏功能和创新培训方法。先进算法对培训效果的贡献,文献中的认知和控制方案进行了详细分析,不同的RMS(R-EM)奖赏功能环境对目前基于RL的研究进行了深入分析,最后将R-RMS的先进发展路径和创新培训方法与现有设计方法加以比较。