Autonomous Driving (AD), the area of robotics with the greatest potential impact on society, has gained a lot of momentum in the last decade. As a result of this, the number of datasets in AD has increased rapidly. Creators and users of datasets can benefit from a better understanding of developments in the field. While scientometric analysis has been conducted in other fields, it rarely revolves around datasets. Thus, the impact, attention, and influence of datasets on autonomous driving remains a rarely investigated field. In this work, we provide a scientometric analysis for over 200 datasets in AD. We perform a rigorous evaluation of relations between available metadata and citation counts based on linear regression. Subsequently, we propose an Influence Score to assess a dataset already early on without the need for a track-record of citations, which is only available with a certain delay.
翻译:自动驾驶(AD)是机器人领域对社会影响最大的领域,并在过去十年里获得了广泛的关注。由此,AD数据集的数量迅速增加。数据集的创造者和用户可以从更好地了解该领域的发展中获益。虽然科学计量分析已在其他领域进行过,但很少涉及数据集。因此,数据集对自动驾驶的影响,关注度和影响力仍然是一个很少研究的领域。在这项工作中,我们对AD中的200多个数据集进行了科学计量分析。我们基于线性回归对可用元数据和引用计数之间的关系进行了严格评估。随后,我们提出了一个影响力评分,以在没有引用计数记录的情况下早期评估数据集。