PPPScenes:3D中虫害行动预测的新数据集和基线 (PePScenes: A Novel Dataset and Baseline for Pedestrian Action Prediction in 3D)

Predicting the behavior of road users, particularly pedestrians, is vital for safe motion planning in the context of autonomous driving systems. Traditionally, pedestrian behavior prediction has been realized in terms of forecasting future trajectories. However, recent evidence suggests that predicting higher-level actions, such as crossing the road, can help improve trajectory forecasting and planning tasks accordingly. There are a number of existing datasets that cater to the development of pedestrian action prediction algorithms, however, they lack certain characteristics, such as bird's eye view semantic map information, 3D locations of objects in the scene, etc., which are crucial in the autonomous driving context. To this end, we propose a new pedestrian action prediction dataset created by adding per-frame 2D/3D bounding box and behavioral annotations to the popular autonomous driving dataset, nuScenes. In addition, we propose a hybrid neural network architecture that incorporates various data modalities for predicting pedestrian crossing action. By evaluating our model on the newly proposed dataset, the contribution of different data modalities to the prediction task is revealed. The dataset is available at https://github.com/huawei-noah/PePScenes.

翻译：预测道路使用者,特别是行人的行为,对于自主驾驶系统的安全运动规划至关重要。传统上,行人行为预测是在预测未来轨迹方面实现的。然而,最近的证据表明,预测更高层次的行动,如跨过道路,有助于相应改进轨迹预测和规划任务。现有一些数据集有利于发展行人行动预测算法,但它们缺乏某些特征,如鸟眼眼视语义地图信息、现场物体的3D位置等,这对自主驾驶至关重要。为此,我们提议建立一个新的行人行动预测数据集,通过在流行的自主驾驶数据集(nuScenes)中添加一个 Perframe 2D/3D 边框和行为说明。此外,我们提议建立一个混合神经网络架构,纳入预测行人过境行动的各种数据模式。通过对新提议的数据集的模型进行评估,可以揭示不同数据模式对预测任务的贡献。数据集可在 https://github.com/hua-noah/PEScen查阅 https://github.com/wai-pes/PESen.

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【经典书】线性代数，Linear Algebra，525页pdf

专知会员服务

79+阅读 · 2021年1月29日

【CVPR2020】语义增强的场景文本识别的编码-解码器框架，SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition

专知会员服务

25+阅读 · 2020年5月22日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

专知会员服务

17+阅读 · 2020年3月9日