Training reinforcement learning agents at solving a given task is highly dependent on identifying optimal sets of hyperparameters and selecting suitable environment input / output configurations. This tedious process could be eased with a straightforward toolbox allowing its user to quickly compare different training parameter sets. We present rl_reach, a self-contained, open-source and easy-to-use software package designed to run reproducible reinforcement learning experiments for customisable robotic reaching tasks. rl_reach packs together training environments, agents, hyperparameter optimisation tools and policy evaluation scripts, allowing its users to quickly investigate and identify optimal training configurations. rl_reach is publicly available at this URL: https://github.com/PierreExeter/rl_reach.
翻译:强化培训学习机构在解决特定任务时,高度依赖确定最佳的超参数和选择合适的环境输入/输出配置。这一无聊的过程可以通过一个直接的工具箱来缓解,使用户能够快速比较不同的培训参数组。我们展示了rl_reach,这是一个自成一体、开放源码和易于使用的软件包,旨在为定制的机器人达标任务进行可复制的强化学习实验。rl_referve 包将培训环境、代理、超光谱优化工具和政策评价脚本结合起来,使用户能够迅速调查和确定最佳培训配置。 rl_referation可在以下网址上公开查阅:https://github.com/PierreExeter/rl_reach。