For the purpose of inspecting power plants, autonomous robots can be built using reinforcement learning techniques. The method replicates the environment and employs a simple reinforcement learning (RL) algorithm. This strategy might be applied in several sectors, including the electricity generation sector. A pre-trained model with perception, planning, and action is suggested by the research. To address optimization problems, such as the Unmanned Aerial Vehicle (UAV) navigation problem, Deep Q-network (DQN), a reinforcement learning-based framework that Deepmind launched in 2015, incorporates both deep learning and Q-learning. To overcome problems with current procedures, the research proposes a power plant inspection system incorporating UAV autonomous navigation and DQN reinforcement learning. These training processes set reward functions with reference to states and consider both internal and external effect factors, which distinguishes them from other reinforcement learning training techniques now in use. The key components of the reinforcement learning segment of the technique, for instance, introduce states such as the simulation of a wind field, the battery charge level of an unmanned aerial vehicle, the height the UAV reached, etc. The trained model makes it more likely that the inspection strategy will be applied in practice by enabling the UAV to move around on its own in difficult environments. The average score of the model converges to 9,000. The trained model allowed the UAV to make the fewest number of rotations necessary to go to the target point.
翻译:为了检查发电厂,可以使用强化学习技术建造自主机器人。该方法复制环境,并采用简单的强化学习算法。这一战略可以适用于几个部门,包括发电部门。研究建议了一种预培训模式,具有感知、规划和行动。为了解决优化问题,如无人驾驶飞行器导航问题、深Q网络(DQN),这是2015年启动的深明号发射的强化学习框架,既包括深层学习,也包括Q学习。为了克服当前程序的问题,研究建议了一个电动厂检查系统,包括UAV自主导航和DQN强化学习。这些培训程序设定了奖励功能,并同时考虑了内部和外部影响因素,将这些因素与目前使用的其他强化学习培训技术区分开来。例如,该技术的强化学习部分的关键组成部分引入了诸如模拟风场、无人驾驶航空飞行器的电池充电量、达到的高度等状态。经过培训的模型更可能使自己有必要的检查战略在参照国家自主自动飞行器自主导航和DQN强化学习的同时,同时考虑内外影响因素。这些培训过程的奖励功能与目前使用的其他强化学习训练训练训练技术技术的学习技术。例如模拟风场模拟、无人驾驶飞行器的电池的电池的电池电压水平、高度等等。</s>