In this work we address the task of observing the performance of a semantic segmentation deep neural network (DNN) during online operation, i.e., during inference, which is of high importance in safety-critical applications such as autonomous driving. Here, many high-level decisions rely on such DNNs, which are usually evaluated offline, while their performance in online operation remains unknown. To solve this problem, we propose an improved online performance prediction scheme, building on a recently proposed concept of predicting the primary semantic segmentation task's performance. This can be achieved by evaluating the auxiliary task of monocular depth estimation with a measurement supplied by a LiDAR sensor and a subsequent regression to the semantic segmentation performance. In particular, we propose (i) sequential training methods for both tasks in a multi-task training setup, (ii) to share the encoder as well as parts of the decoder between both task's networks for improved efficiency, and (iii) a temporal statistics aggregation method, which significantly reduces the performance prediction error at the cost of a small algorithmic latency. Evaluation on the KITTI dataset shows that all three aspects improve the performance prediction compared to previous approaches.
翻译:在这项工作中,我们处理的是在在线操作期间,即在推断期间,观测静语分解深神经网络(DNN)的性能的任务,这在自动驾驶等安全关键应用中非常重要。这里,许多高层决定依赖于这种DNN, 通常在离线操作中进行评价,而它们在在线操作中的性能仍然未知。为了解决这个问题,我们建议改进在线性能预测计划,以最近提出的预测主要静语分解任务性能的概念为基础。可以通过使用LIDAR传感器提供的测量和随后对语义分解性能的回归,评估单方深度估计的辅助任务,实现这一点。我们特别建议:(一) 在多任务培训设置中,对这两项任务进行顺序培训,(二) 将编码器和分解器的部分在两个任务网络之间共享,以提高效率,以及(三) 时间统计汇总方法,这大大降低了以小算法成本计算出的业绩预测错误。我们建议,对KITTI数据进行比较后,所有三个方面都改进了前一个预测。