简便、高效、少见的语文模式学习 (PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models)

Current methods for few-shot fine-tuning of pretrained masked language models (PLMs) require carefully engineered prompts and verbalizers for each new task to convert examples into a cloze-format that the PLM can score. In this work, we propose PERFECT, a simple and efficient method for few-shot fine-tuning of PLMs without relying on any such handcrafting, which is highly effective given as few as 32 data points. PERFECT makes two key design choices: First, we show that manually engineered task prompts can be replaced with task-specific adapters that enable sample-efficient fine-tuning and reduce memory and storage costs by roughly factors of 5 and 100, respectively. Second, instead of using handcrafted verbalizers, we learn new multi-token label embeddings during fine-tuning, which are not tied to the model vocabulary and which allow us to avoid complex auto-regressive decoding. These embeddings are not only learnable from limited data but also enable nearly 100x faster training and inference. Experiments on a wide range of few-shot NLP tasks demonstrate that PERFECT, while being simple and efficient, also outperforms existing state-of-the-art few-shot learning methods. Our code is publicly available at https://github.com/facebookresearch/perfect.git.

翻译：目前对事先经过训练的蒙面语言模型(PLM)进行微小微调的方法需要为每项新任务精心设计的提示和言语,以便将示例转换成Cluze-format(PLM能分得分)。在这项工作中,我们建议PERFECT(PERFECT),这是在不依赖任何这种手工艺的情况下对PLMS进行微调的简单而有效的方法,这种方法非常有效,因为只有32个数据点。PerfECT有两个关键的设计选择:首先,我们表明手工设计的任务提示可以由特定任务调整器取代,使样本高效的微调和减少记忆和存储成本,分别以5和100个系数计算。第二,我们不使用手制作的口语调器,而是在微调期间学习新的多端标签嵌入,而无需依赖任何这种手工艺,这使我们能够避免复杂的自动反向解析。这些嵌入不仅可以从有限的数据中学习,而且能够使近100x更快的培训和推断。在少数的NLPP-P任务范围进行实验,同时进行微的实验,我们现有的快速的版本/正制方法也是简单的。

相关内容

小样本学习

关注 215

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日