We introduce several new datasets namely ImageNet-A/O and ImageNet-R as well as a synthetic environment and testing suite we called CAOS. ImageNet-A/O allow researchers to focus in on the blind spots remaining in ImageNet. ImageNet-R was specifically created with the intention of tracking robust representation as the representations are no longer simply natural but include artistic, and other renditions. The CAOS suite is built off of CARLA simulator which allows for the inclusion of anomalous objects and can create reproducible synthetic environment and scenes for testing robustness. All of the datasets were created for testing robustness and measuring progress in robustness. The datasets have been used in various other works to measure their own progress in robustness and allowing for tangential progress that does not focus exclusively on natural accuracy. Given these datasets, we created several novel methods that aim to advance robustness research. We build off of simple baselines in the form of Maximum Logit, and Typicality Score as well as create a novel data augmentation method in the form of DeepAugment that improves on the aforementioned benchmarks. Maximum Logit considers the logit values instead of the values after the softmax operation, while a small change produces noticeable improvements. The Typicality Score compares the output distribution to a posterior distribution over classes. We show that this improves performance over the baseline in all but the segmentation task. Speculating that perhaps at the pixel level the semantic information of a pixel is less meaningful than that of class level information. Finally the new augmentation technique of DeepAugment utilizes neural networks to create augmentations on images that are radically different than the traditional geometric and camera based transformations used previously.
翻译:我们引入了几套新的数据集,即图像Net-A/O和图像网络-R,以及一个合成环境和测试套件,我们称之为CAOS。图像Net-A/O允许研究人员关注图像Net中残留的盲点。图像Net-R是专门为跟踪稳健的表达方式而创建的,因为其表达方式不再单纯是自然的,而是包括艺术和其他移位。CAOS套件建于CARLA模拟器之外,它允许包含异常对象,并且可以创造可复制的传统合成环境与图像以测试稳健性。所有数据集都是为测试稳健性和测量进展而创建的。所有数据集都被用于测试稳健性和测量稳健性。其他各种工作都使用了数据集来衡量自身在稳健性上的进展,并允许不完全以自然准确性为焦点的进展。鉴于这些数据集,我们创建了若干新颖的方法,目的是推进稳健性研究。我们从以最高级测点和典型度评分制的形式构建了简单的基线,并创建了一种新型的数据增强方法,从而改进了上述基准的深度缩缩缩缩缩缩缩缩缩缩缩缩图。我们使用了Slial的递图,而最后的递平级的递平平平平平平平比平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平平