Predicting future behaviors of road agents is a key task in autonomous driving. While existing models have demonstrated great success in predicting marginal agent future behaviors, it remains a challenge to efficiently predict consistent joint behaviors of multiple agents. Recently, the occupancy flow fields representation was proposed to represent joint future states of road agents through a combination of occupancy grid and flow, which supports efficient and consistent joint predictions. In this work, we propose a novel occupancy flow fields predictor to produce accurate occupancy and flow predictions, by combining the power of an image encoder that learns features from a rasterized traffic image and a vector encoder that captures information of continuous agent trajectories and map states. The two encoded features are fused by multiple attention modules before generating final predictions. Our simple but effective model ranks 3rd place on the Waymo Open Dataset Occupancy and Flow Prediction Challenge, and achieves the best performance in the occluded occupancy and flow prediction task.
翻译:预测道路代理商的未来行为是自主驾驶的一项关键任务。 虽然现有模型在预测边缘代理商未来行为方面表现出巨大成功,但有效预测多个代理商一致的共同行为仍是一项挑战。 最近,提议使用流动字段代表道路代理商的未来共同状态,办法是结合使用网和流量,支持高效和一致的联合预测。 在这项工作中,我们提议建立一个新的占用流动域预测器,以产生准确的占用和流量预测,方法是将一个图像编码器的力量结合起来,从分层交通图像中学习特征,以及一个矢量编码器,捕捉连续代理商轨迹和地图状态的信息。两个编码的功能在产生最终预测之前由多个关注模块结合。我们在Waymo Open Datasec Occup 和流程预测挑战中的简单而有效的模型排名第三,并实现隐蔽的占用和流量预测任务的最佳表现。