Traffic simulation plays a crucial role in evaluating and improving autonomous driving planning systems. After being deployed on public roads, autonomous vehicles need to interact with human road participants with different social preferences (e.g., selfish or courteous human drivers). To ensure that autonomous vehicles take safe and efficient maneuvers in different interactive traffic scenarios, we should be able to evaluate autonomous vehicles against reactive agents with different social characteristics in the simulation environment. We propose a socially-controllable behavior generation (SCBG) model for this purpose, which allows the users to specify the level of courtesy of the generated trajectory while ensuring realistic and human-like trajectory generation through learning from real-world driving data. Specifically, we define a novel and differentiable measure to quantify the level of courtesy of driving behavior, leveraging marginal and conditional behavior prediction models trained from real-world driving data. The proposed courtesy measure allows us to auto-label the courtesy levels of trajectories from real-world driving data and conveniently train an SCBG model generating trajectories based on the input courtesy values. We examined the SCBG model on the Waymo Open Motion Dataset (WOMD) and showed that we were able to control the SCBG model to generate realistic driving behaviors with desired courtesy levels. Interestingly, we found that the SCBG model was able to identify different motion patterns of courteous behaviors according to the scenarios.
翻译:交通仿真在评估和改进自主驾驶规划系统方面发挥着至关重要的作用。在部署在公共道路上之后,自主驾驶汽车需要与具有不同社交需求(例如,自私或彬彬有礼的人类驾驶员)的人类路上参与者进行交互。为了确保自主驾驶汽车在不同的交互式交通场景中进行安全和高效的操作,我们应该能够在仿真环境中评估自主驾驶汽车针对具有不同社交特征的反应型代理。为此,我们提出了可社交控制行为生成模型(SCBG),该模型允许用户指定所生成轨迹的礼貌程度,并通过从真实驾驶数据中学习确保实现逼真且类似于人类的轨迹生成。具体而言,我们定义了一种新颖且可微分的测量方法来量化驾驶行为的礼貌程度,利用从真实驾驶数据中训练出的边际和条件行为预测模型。所提出的礼貌程度测量方法允许我们自动标记来自真实世界驾驶数据的轨迹的礼貌度级别,并通过输入礼貌值来方便地训练生成轨迹的SCBG模型。我们在Waymo Open Motion数据集(WOMD)上对SCBG模型进行了测试,并显示我们能够控制SCBG模型以生成具有预期礼貌级别的逼真驾驶行为。有趣的是,我们发现SCBG模型能够根据场景识别礼貌行为的不同运动模式。