Real-time applications of energy management strategies (EMSs) in hybrid electric vehicles (HEVs) are the harshest requirements for researchers and engineers. Inspired by the excellent problem-solving capabilities of deep reinforcement learning (DRL), this paper proposes a real-time EMS via incorporating the DRL method and transfer learning (TL). The related EMSs are derived from and evaluated on the real-world collected driving cycle dataset from Transportation Secure Data Center (TSDC). The concrete DRL algorithm is proximal policy optimization (PPO) belonging to the policy gradient (PG) techniques. For specification, many source driving cycles are utilized for training the parameters of deep network based on PPO. The learned parameters are transformed into the target driving cycles under the TL framework. The EMSs related to the target driving cycles are estimated and compared in different training conditions. Simulation results indicate that the presented transfer DRL-based EMS could effectively reduce time consumption and guarantee control performance.
翻译:在混合电动车辆(HEV)中实时应用能源管理战略(EMS)是研究人员和工程师最苛刻的要求。在深度强化学习(DRL)的出色解决问题能力激励下,本文件建议通过采用DRL方法和转移学习(TL)来实时实施EMS。相关的EMS来自从运输安全数据中心(TSDC)收集的真实世界驾驶周期数据集,并根据这些数据进行评估。具体的DRL算法是属于政策梯度(PG)技术的近似政策优化(PPPPO)。关于规格,许多源驱动周期被用于培训基于PPPO的深层网络参数。所学的参数被转换为TL框架下的目标驱动周期。与目标驱动周期有关的EMS在不同的培训条件下进行估计和比较。模拟结果表明,基于DRL EMS的转移可有效减少时间消耗和保证控制性能。