High-capacity deep neural networks (DNNs) trained with Empirical Risk Minimization (ERM) often suffer from poor worst-group accuracy despite good on-average performance, where worst-group accuracy measures a model's robustness towards certain subpopulations of the input space. Spurious correlations and memorization behaviors of ERM trained DNNs are typically attributed to this degradation in performance. We develop a method, called CRIS, that address these issues by performing robust classifier retraining on independent splits of the dataset. This results in a simple method that improves upon state-of-the-art methods, such as Group DRO, on standard datasets while relying on much fewer group labels and little additional hyperparameter tuning.
翻译:高能深神经网络(DNNs)经过经验风险最小化(ERM)培训,其精度往往最差,尽管平均表现良好,但质量最差,最差的精确度衡量模型对输入空间某些亚群的坚固度,机构风险管理培训的DNS的纯相关性和记忆性行为通常归因于这种性能退化。我们开发了一种方法,称为CRIS,通过对数据集的独立分割进行强有力的分类再培训来解决这些问题。这导致一种简单的方法,在使用标准数据集的同时,在使用最先进的方法(如DRO小组)的同时,在使用更小的群类标签和少量的超立度调整方面,改进了标准数据集。