Calibration and uncertainty estimation are crucial topics in high-risk environments. We introduce a new diversity regularizer for classification tasks that uses out-of-distribution samples and increases the overall accuracy, calibration and out-of-distribution detection capabilities of ensembles. Following the recent interest in the diversity of ensembles, we systematically evaluate the viability of explicitly regularizing ensemble diversity to improve calibration on in-distribution data as well as under dataset shift. We demonstrate that diversity regularization is highly beneficial in architectures, where weights are partially shared between the individual members and even allows to use fewer ensemble members to reach the same level of robustness. Experiments on CIFAR-10, CIFAR-100, and SVHN show that regularizing diversity can have a significant impact on calibration and robustness, as well as out-of-distribution detection.
翻译:在高风险环境中,校准和不确定性估算是关键议题。我们为分类任务引入了新的多样性常规化机制,使用分配以外的样本,提高编组的总体精确度、校准度和分配以外的检测能力。由于最近对组合多样性的兴趣,我们系统地评估明确规范组合多样性的可行性,以改进分配数据的校准以及数据集转换。我们表明,在结构中,多样性规范化非常有益,因为各个成员部分地分担了加权,甚至允许使用较少的合用成员达到同样的稳健程度。关于CIFAR-10、CIFAR-100和SVHN的实验表明,标准化多样性可能对校准和稳健以及分配之外的检测产生重大影响。