imitation provides open-source implementations of imitation and reward learning algorithms in PyTorch. We include three inverse reinforcement learning (IRL) algorithms, three imitation learning algorithms and a preference comparison algorithm. The implementations have been benchmarked against previous results, and automated tests cover 98% of the code. Moreover, the algorithms are implemented in a modular fashion, making it simple to develop novel algorithms in the framework. Our source code, including documentation and examples, is available at https://github.com/HumanCompatibleAI/imitation
翻译:PyTorrch 的仿真和奖赏学习算法的开源实施提供了仿照和奖赏学习算法的开源实施。我们包括三种反向强化学习算法(IRL)算法、三种模仿学习算法和一种偏好比较算法。这些执行已根据以往的结果进行了基准,自动测试覆盖了代码的98%。此外,这些算法是以模块化方式实施的,使得在框架中开发新的算法变得简单易行。我们的源代码,包括文件和实例,可在https://github.com/HumanCompatibleAI/imitation查阅。