Manually specifying features that capture the diversity in traffic environments is impractical. Consequently, learning-based agents cannot realize their full potential as neural motion planners for autonomous vehicles. Instead, this work proposes to learn which features are task-relevant. Given its immediate relevance to motion planning, our proposed architecture encodes the probabilistic occupancy map as a proxy for obtaining pre-trained state representations. By leveraging a map-aware graph formulation of the environment, our agent-centric encoder generalizes to arbitrary road networks and traffic situations. We show that our approach significantly improves the downstream performance of a reinforcement learning agent operating in urban traffic environments.
翻译:以手写方式说明交通环境的多样性是不切实际的。因此,以学习为基础的代理商无法充分发挥其作为自主车辆神经运动规划者的潜力。相反,这项工作提议了解哪些特征与任务相关。鉴于与行动规划直接相关,我们提议的架构将概率占用图编码为获得预先培训的国家代表的替代物。通过利用地图识别图绘制环境,我们以代理商为中心的编码器对任意的道路网络和交通情况进行了概括。我们表明,我们的方法大大改善了城市交通环境中强化学习代理商的下游业绩。</s>