While automated driving is often advertised with better-than-human driving performance, this work reviews that it is nearly impossible to provide direct statistical evidence on the system level that this is actually the case. The amount of labeled data needed would exceed dimensions of present day technical and economical capabilities. A commonly used strategy therefore is the use of redundancy along with the proof of sufficient subsystems' performances. As it is known, this strategy is efficient especially for the case of subsystems operating independently, i.e. the occurrence of errors is independent in a statistical sense. Here, we give some first considerations and experimental evidence that this strategy is not a free ride as the errors of neural networks fulfilling the same computer vision task, at least for some cases, show correlated occurrences of errors. This remains true, if training data, architecture, and training are kept separate or independence is trained using special loss functions. Using data from different sensors (realized by up to five 2D projections of the 3D MNIST data set) in our experiments is more efficiently reducing correlations, however not to an extent that is realizing the potential of reduction of testing data that can be obtained for redundant and statistically independent subsystems.
翻译:虽然自动化驾驶往往以非人驾驶的性能为广告做广告,但这项工作回顾,几乎不可能在系统一级提供直接的统计证据,说明情况确实如此。贴标签的数据数量将超过当今技术和经济能力的范围。因此,一个常用的战略是使用冗余和足够的子系统性能的证明。众所周知,这一战略对独立运作的子系统特别有效,即发生误差在统计意义上是独立的。这里,我们首先考虑和实验性证据表明,这一战略不是免费搭乘的,因为神经网络在完成相同的计算机视觉任务时有误差,至少在一些情况下,显示出相关的差错。如果培训数据、结构和培训是分开的,或者使用特殊损失功能进行独立培训,这仍然是正确的。在我们的实验中,使用不同传感器的数据(通过对三维MNIST数据集的高达5个2D的预测实现),可以更有效地减少关联性,但不能实现减少为冗余和统计上独立的子系统获得的测试数据的可能性。