My project tackles the question of whether Ray can be used to quickly train autonomous vehicles using a simulator (Carla), and whether a platform robust enough for further research purposes can be built around it. Ray is an open-source framework that enables distributed machine learning applications. Distributed computing is a technique which parallelizes computational tasks, such as training a model, among many machines. Ray abstracts away the complex coordination of these machines, making it rapidly scalable. Carla is a vehicle simulator that generates data used to train a model. The bulk of the project was writing the training logic that Ray would use to train my distributed model. Imitation learning is the best fit for autonomous vehicles. Imitation learning is an alternative to reinforcement learning and it works by trying to learn the optimal policy by imitating an expert (usually a human) given a set of demonstrations. A key deliverable for the project was showcasing my trained agent in a few benchmark tests, such as navigating a complex turn through traffic. Beyond that, the broader ambition was to develop a research platform where others could quickly train and run experiments on huge amounts of Carla vehicle data. Thus, my end product is not a single model, but a large-scale, open-source research platform (RayCarla) for autonomous vehicle researchers to utilize.
翻译:我的项目解决了这样一个问题,即Ray是否可以用模拟器(Carla)快速培训自主车辆,以及能否在模拟器(Carla)周围建造一个足以用于进一步研究目的的平台。Ray是一个开放的源码框架,它可以使分散的机器学习应用。分布式计算是一种将计算任务(例如培训模型)与许多机器平行进行的技术。Ray总结了这些机器的复杂协调,使其迅速可扩展。Carla是一个生成用于培训模型的数据的车辆模拟器。该项目的大部分内容是编写Ray用来培训我分布式模型的培训逻辑。模仿学习是最适合自动车辆的。模仿学习是强化学习的另一种选择,它通过模仿专家(通常是人类)进行一系列演示来尝试学习最佳政策。Ray可以交付的钥匙是在一些基准测试中展示我受过训练的代理器,例如浏览一个复杂的交通转弯。此外,更广泛的雄心是开发一个研究平台,让其他人可以快速培训和进行大量卡拉汽车数据实验。因此,我最后的一位研究产品不是用于大型的自动分析平台。