通过对小型数据进行对抗性预培训和为小型数据进行远程培训,进行 " 零 " 和 " 很少热 " 学习 (Long-Tail Zero and Few-Shot Learning via Contrastive Pretraining on and for Small Data)

from arxiv, majorly reworked version. New results, new long-tail analysis, new plots, extended appendix, more citations and related research, better method description. Different focus of evaluation

For natural language processing 'text-to-text' tasks, the prevailing approaches heavily rely on pretraining large self-supervised models on massive external data sources, which incurs exceptional pretraining data requirements and a diminished ability to pretrain over small datasets. However, fundamental pretraining method capabilities like few to zero-shot learning or preserving minority concept (long-tail) prediction performance along with accordingly designed evaluation scenarios remain open challenges. We thus propose Contrastive Label-Embedding Self-Supervision (CLESS) pretraining, which enables pretraining from multiple magnitudes smaller, 'task internal' data only, while still strongly improving fully supervised, long-tail, few-shot and self-supervised zero-shot learning abilities. Accordingly, we analyse improvements in learning dynamics over baselines on a challenging long-tailed, low-resource, multi-label text classification scenario with noisy, highly sparse labels and many minority concepts. We find that long-tailed zero and few-shot learning markedly benefit from increasing 'dataset-internal' self-supervised pretraining signals, to help reduce the reliance on large external sources.

翻译：对于自然语言处理“文本到文本”的任务,普遍的做法在很大程度上依赖于对大型外部数据源的大型自监督模型进行预先培训,这需要特殊的培训前数据要求,对小型数据集的预先培训能力降低,然而,基本的培训前方法能力,如少到零的学习能力或维护少数概念(长尾)的预测性能,以及相应设计的评估假想,仍然是尚未解决的挑战。因此,我们提出了反标签和自闭自视预培训(CLESS)预培训(CLES)方案,使培训前能够利用规模较小的多层次的“任务内部”数据进行预培训,同时继续大力改进完全监督下的、长尾的、少发的和自上手的零弹学习能力。因此,我们分析了在具有挑战性的长期尾尾、低资源、多标签分类情景和许多少数民族概念的基线上学习动态的改进情况。我们发现,增加“内部数据集”自我监督的自我监督前信号,对长期零和少发的学习有明显的好处,有助于减少对大型外部来源的依赖。

相关内容

小样本学习

关注 215

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

【AAAI2021】对比聚类，Contrastive Clustering

专知会员服务

78+阅读 · 2021年1月30日

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【Google】监督对比学习，Supervised Contrastive Learning

专知会员服务

75+阅读 · 2020年4月24日