Although state-of-the-art object detection methods have shown compelling performance, models often are not robust to adversarial attacks and out-of-distribution data. We introduce a new dataset, Natural Adversarial Objects (NAO), to evaluate the robustness of object detection models. NAO contains 7,934 images and 9,943 objects that are unmodified and representative of real-world scenarios, but cause state-of-the-art detection models to misclassify with high confidence. The mean average precision (mAP) of EfficientDet-D7 drops 74.5% when evaluated on NAO compared to the standard MSCOCO validation set. Moreover, by comparing a variety of object detection architectures, we find that better performance on MSCOCO validation set does not necessarily translate to better performance on NAO, suggesting that robustness cannot be simply achieved by training a more accurate model. We further investigate why examples in NAO are difficult to detect and classify. Experiments of shuffling image patches reveal that models are overly sensitive to local texture. Additionally, using integrated gradients and background replacement, we find that the detection model is reliant on pixel information within the bounding box, and insensitive to the background context when predicting class labels. NAO can be downloaded at https://drive.google.com/drive/folders/15P8sOWoJku6SSEiHLEts86ORfytGezi8.
翻译:虽然最先进的物体探测方法显示令人信服的性能,但模型往往对对抗性攻击和分配外数据不可靠。我们引入了一个新的数据集,即自然反向天体(NAO),以评价物体探测模型的可靠性。NAO包含7,934图像和9,943天体,这些图像和9,993天体没有经过修改,并代表了现实世界的情景,但导致最先进的探测模型以高度自信对状态检测模型进行错误分类。高效 Det-D7的平均平均精确度(mAP)在对NAO的评价中,与标准 MCCO 校验数据集相比,下降了74.5 %。此外,通过比较各种天体探测结构,我们发现MSCCO 校验成套的更好性能不一定转化为对NAO的更好性能,这表明光力不能简单地通过培训一个更准确的模型来实现。我们进一步调查为什么NAO中的例子很难被检测和分类。6 调控图像的实验表明,模型对当地质素养过度敏感。此外,使用综合的梯度和背景替换,我们发现,在SEARGGG/CRIFF的底,可以对O的定位进行背景进行定位的定位。