FLEX: 低热 NLP 统一评价 (FLEX: Unifying Evaluation for Few-Shot NLP)

Few-shot NLP research is highly active, yet conducted in disjoint research threads with evaluation suites that lack challenging-yet-realistic testing setups and fail to employ careful experimental design. Consequently, the community does not know which techniques perform best or even if they outperform simple baselines. In response, we formulate the FLEX Principles, a set of requirements and best practices for unified, rigorous, valid, and cost-sensitive few-shot NLP evaluation. These principles include Sample Size Design, a novel approach to benchmark design that optimizes statistical accuracy and precision while keeping evaluation costs manageable. Following the principles, we release the FLEX benchmark, which includes four few-shot transfer settings, zero-shot evaluation, and a public leaderboard that covers diverse NLP tasks. In addition, we present UniFew, a prompt-based model for few-shot learning that unifies pretraining and finetuning prompt formats, eschewing complex machinery of recent prompt-based approaches in adapting downstream task formats to language model pretraining objectives. We demonstrate that despite simplicity, UniFew achieves results competitive with both popular meta-learning and prompt-based approaches.

翻译：微弱的NLP研究非常活跃,但是在与缺乏富有挑战性的现实测试设置和没有采用仔细实验设计的评价套件脱节的研究线索中进行,因此,社区不知道哪些技术最有效,即使它们优于简单的基线,也不知道哪些技术优于简单的基准。作为回应,我们制定了FLEX原则,一套统一、严格、有效、成本敏感的少见的NLP评价要求和最佳做法。这些原则包括抽样规模设计,这是衡量设计基准设计的新办法,在保持评价成本的同时,优化统计准确性和精确性。按照这些原则,我们公布了FLEX基准,其中包括四个短发的转移设置、零发评价以及一个涵盖各种NLP任务的公共领导板。此外,我们介绍了UniFew,一个快速的、基于几发式学习的模型,该模型整合了快速格式的预培训和微调,筛选了最近采用快速方法使下游任务格式适应语言模式培训前目标的复杂机制。我们证明,尽管简洁,UFew取得了与流行的元学习和快速方法的竞争性。

相关内容

小样本学习

关注 215

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

15%接受率！AAAI2022结果出炉，1349篇上榜，你的paper中了吗？

专知会员服务

37+阅读 · 2021年12月2日

复旦大学邱锡鹏等《自然语言处理范式迁移综述》论文，详述7大NLP范式：分类、匹配、SeqLab, MRC, Seq2Seq等

专知会员服务

54+阅读 · 2021年9月29日

自然语言处理顶会EMNLP2020接受论文列表，754篇论文都在这儿了！

专知会员服务

28+阅读 · 2020年10月26日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日