Simulation has the potential to massively scale evaluation of self-driving systems enabling rapid development as well as safe deployment. To close the gap between simulation and the real world, we need to simulate realistic multi-agent behaviors. Existing simulation environments rely on heuristic-based models that directly encode traffic rules, which cannot capture irregular maneuvers (e.g., nudging, U-turns) and complex interactions (e.g., yielding, merging). In contrast, we leverage real-world data to learn directly from human demonstration and thus capture a more diverse set of actor behaviors. To this end, we propose TrafficSim, a multi-agent behavior model for realistic traffic simulation. In particular, we leverage an implicit latent variable model to parameterize a joint actor policy that generates socially-consistent plans for all actors in the scene jointly. To learn a robust policy amenable for long horizon simulation, we unroll the policy in training and optimize through the fully differentiable simulation across time. Our learning objective incorporates both human demonstrations as well as common sense. We show TrafficSim generates significantly more realistic and diverse traffic scenarios as compared to a diverse set of baselines. Notably, we can exploit trajectories generated by TrafficSim as effective data augmentation for training better motion planner.
翻译:模拟有可能大规模地评估自我驾驶系统,从而能够快速发展和安全地部署。为了缩小模拟与现实世界之间的差距,我们需要模拟现实多剂行为。现有的模拟环境依赖于直接编码交通规则的基于疲劳的模型,这些模型无法捕捉非常规动作(如裸体、Uturns)和复杂的相互作用(如收成、合并)。相比之下,我们利用真实世界数据直接从人类演示中学习,从而捕捉出一套更加多样化的行为者行为。为此,我们提出TeleSim,这是一个用于现实交通模拟的多剂行为模型。特别是,我们利用隐含的潜在变异模型,将一个联合行为者政策参数化,为现场所有行为者共同制定社会一致的计划。要学习一种适合长视野模拟的强有力政策,我们通过完全不同的模拟,在培训和优化中引入政策。我们的学习目标既包括人类演示,又包括一套共同感官。我们展示StraSim能够大大地产生更加现实和多样化的交通假设情景,而比一个多样化的移动性模型能产生更好的稳定度基线。我们通过不同的数据来利用。