对低资源领域具体软件要求进行分类的几小笔学习方法 (Few-shot learning approaches for classifying low resource domain specific software requirements)

With the advent of strong pre-trained natural language processing models like BERT, DeBERTa, MiniLM, T5, the data requirement for industries to fine-tune these models to their niche use cases has drastically reduced (typically to a few hundred annotated samples for achieving a reasonable performance). However, the availability of even a few hundred annotated samples may not always be guaranteed in low resource domains like automotive, which often limits the usage of such deep learning models in an industrial setting. In this paper we aim to address the challenge of fine-tuning such pre-trained models with only a few annotated samples, also known as Few-shot learning. Our experiments focus on evaluating the performance of a diverse set of algorithms and methodologies to achieve the task of classifying BOSCH automotive domain textual software requirements into 3 categories, while utilizing only 15 annotated samples per category for fine-tuning. We find that while SciBERT and DeBERTa based models tend to be the most accurate at 15 training samples, their performance improvement scales minimally as the number of annotated samples is increased to 50 in comparison to Siamese and T5 based models.

翻译：随着诸如BERT、DeBERTA、MiniLM、T5等经过预先训练的强力自然语言处理模型的出现,各行业对这些模型进行微调以适应其特殊用途案例的数据要求已大大减少(通常为达到合理性能而向数百个附加说明的样本);然而,在诸如汽车等低资源领域,甚至甚至几百个附加说明的样本也不一定总能得到保证,这往往限制了这种深层次学习模型在工业环境中的使用;在本文件中,我们旨在应对微调这类经过预先训练的模型的挑战,只有几个附加说明的样本,也称为少见的学习;我们实验的重点是评估一套不同的算法和方法的性能,以便完成将BOSCH汽车域域名软件要求分为三类的任务,同时只使用每类15个附加说明的样本进行微调;我们发现,虽然SciBERT和DeBERTA模型往往在15个培训样本中最准确,但其性能改进尺度最小,因为附加说明的样本数量与Siamese和T5模型相比增加到50个。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日