采取一克神经特征, 获得增强的集团强力 (Take One Gram of Neural Features, Get Enhanced Group Robustness)

Predictive performance of machine learning models trained with empirical risk minimization (ERM) can degrade considerably under distribution shifts. The presence of spurious correlations in training datasets leads ERM-trained models to display high loss when evaluated on minority groups not presenting such correlations. Extensive attempts have been made to develop methods improving worst-group robustness. However, they require group information for each training input or at least, a validation set with group labels to tune their hyperparameters, which may be expensive to get or unknown a priori. In this paper, we address the challenge of improving group robustness without group annotation during training or validation. To this end, we propose to partition the training dataset into groups based on Gram matrices of features extracted by an ``identification'' model and to apply robust optimization based on these pseudo-groups. In the realistic context where no group labels are available, our experiments show that our approach not only improves group robustness over ERM but also outperforms all recent baselines

翻译：培训数据集中存在虚假的关联,导致机构风险管理培训模式在对没有显示这种关联的少数群体进行评估时显示大量亏损。已经作出广泛努力,制定方法改进最差群体的稳健性。然而,它们要求为每项培训投入提供群体信息,或至少为每个培训投入提供一组信息,或至少提供一组标签的验证组标签,以调整其超参数,这些参数可能昂贵,或者事先可能不知道。在本文中,我们讨论了在培训或验证过程中无需集体批注而提高集体稳健性的挑战。为此,我们提议将培训数据集分成基于“身份识别”模型所提取特征的格拉姆矩阵的小组,并根据这些伪组采用稳健的优化。在没有分组标签的现实背景下,我们的实验表明,我们的方法不仅改进了集团对机构风险管理的稳健性,而且超越了最近的所有基线。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日