There is a longstanding interest in capturing the error behaviour of object detectors by finding images where their performance is likely to be unsatisfactory. In real-world applications such as autonomous driving, it is also crucial to characterise potential failures beyond simple requirements of detection performance. For example, a missed detection of a pedestrian close to an ego vehicle will generally require closer inspection than a missed detection of a car in the distance. The problem of predicting such potential failures at test time has largely been overlooked in the literature and conventional approaches based on detection uncertainty fall short in that they are agnostic to such fine-grained characterisation of errors. In this work, we propose to reformulate the problem of finding "hard" images as a query-based hard image retrieval task, where queries are specific definitions of "hardness", and offer a simple and intuitive method that can solve this task for a large family of queries. Our method is entirely post-hoc, does not require ground-truth annotations, is independent of the choice of a detector, and relies on an efficient Monte Carlo estimation that uses a simple stochastic model in place of the ground-truth. We show experimentally that it can be applied successfully to a wide variety of queries for which it can reliably identify hard images for a given detector without any labelled data. We provide results on ranking and classification tasks using the widely used RetinaNet, Faster-RCNN, Mask-RCNN, and Cascade Mask-RCNN object detectors.
翻译:长久以来,人们一直希望通过寻找可能无法令人满意的图像来捕捉物体探测器的错误行为。 在诸如自主驾驶等现实世界应用中,除了简单的探测性能要求之外,还有必要描述潜在故障的特征。例如,对接近自利汽车行人失手的探测通常要求更仔细的检查,而不是对远处汽车失手的检测。在检测性不确定性的基础上预测这种潜在故障的问题在文献和常规方法中大都被忽视,因为检测性能的不确定性不够充分,因为它们对错误的精细区分性格特征缺乏知觉。在这项工作中,我们提议将寻找“硬性”图像的问题重新描述为基于查询的硬性图像检索任务,其中查询是“硬性性”的具体定义,并提供简单和不直观的方法,从而能够解决大型查询者的这一任务。我们的方法完全是事后的,并不需要地面测量性图解,而是独立于检测者,并且依赖于一个高效的蒙特卡洛估算,在地面-RC(Rechal)的深度查询中可以使用一个简单的随机模型,我们用一个实验性的数据来成功地检测。