Uncertainty estimates help to identify ambiguous, novel, or anomalous inputs, but the reliable quantification of uncertainty has proven to be challenging for modern deep networks. In order to improve uncertainty estimation, we propose On-Manifold Adversarial Data Augmentation or OMADA, which specifically attempts to generate the most challenging examples by following an on-manifold adversarial attack path in the latent space of an autoencoder-based generative model that closely approximates decision boundaries between two or more classes. On a variety of datasets as well as on multiple diverse network architectures, OMADA consistently yields more accurate and better calibrated classifiers than baseline models, and outperforms competing approaches such as Mixup, as well as achieving similar performance to (at times better than) post-processing calibration methods such as temperature scaling. Variants of OMADA can employ different sampling schemes for ambiguous on-manifold examples based on the entropy of their estimated soft labels, which exhibit specific strengths for generalization, calibration of predicted uncertainty, or detection of out-of-distribution inputs.
翻译:不确定性估计有助于确定模糊、新颖或异常的投入,但可靠地量化不确定性对现代深层网络来说具有挑战性。为了改进不确定性估计,我们提议采用在线反对数据增强或OMADA, 具体尝试在基于自动编码器的基因化模型的潜在空间内采用在线对抗性攻击路径来产生最具挑战性的例子,该模型密切接近两个或两个以上类别之间的决定界限。关于各种数据集以及多种不同的网络结构,OMAADA一直产生比基线模型更准确和更好的校准分类器,比混合等相互竞争的方法要好,并取得类似(比温度缩放率等)处理后校准方法的类似性能。OMADA的变式可以采用不同的抽样办法,根据估计软标签的酶,对模糊的上层实例进行模糊性抽样,这在概括、校准预测的不确定性或检测分配外输入方面有具体优势。