Context: Requirements engineering researchers have been experimenting with machine learning and deep learning approaches for a range of RE tasks, such as requirements classification, requirements tracing, ambiguity detection, and modelling. However, most of today's ML/DL approaches are based on supervised learning techniques, meaning that they need to be trained using a large amount of task-specific labelled training data. This constraint poses an enormous challenge to RE researchers, as the lack of labelled data makes it difficult for them to fully exploit the benefit of advanced ML/DL technologies. Objective: This paper addresses this problem by showing how a zero-shot learning approach can be used for requirements classification without using any labelled training data. We focus on the classification task because many RE tasks can be framed as classification problems. Method: The ZSL approach used in our study employs contextual word-embeddings and transformer-based language models. We demonstrate this approach through a series of experiments to perform three classification tasks: (1)FR/NFR: classification functional requirements vs non-functional requirements; (2)NFR: identification of NFR classes; (3)Security: classification of security vs non-security requirements. Results: The study shows that the ZSL approach achieves an F1 score of 0.66 for the FR/NFR task. For the NFR task, the approach yields F1~0.72-0.80, considering the most frequent classes. For the Security task, F1~0.66. All of the aforementioned F1 scores are achieved with zero-training efforts. Conclusion: This study demonstrates the potential of ZSL for requirements classification. An important implication is that it is possible to have very little or no training data to perform classification tasks. The proposed approach thus contributes to the solution of the long-standing problem of data shortage in RE.
翻译:要求:工程研究人员一直在对一系列RE任务,如要求分类、要求追踪、模糊性检测和建模等进行机器学习和深层次学习方法的实验。然而,今天的ML/DL方法大多以监督的学习技术为基础,这意味着他们需要用大量特定任务标记的培训数据接受培训。这种限制对RE研究人员构成巨大挑战,因为缺乏贴标签数据使他们难以充分利用先进的ML/DL技术的好处。 目标:本文件通过展示如何在不使用任何标签的培训数据的情况下,在需求分类中使用零点学习方法。我们注重分类工作,因为许多RE方法可以被描述为分类问题。方法:我们的研究中使用的ZSL方法需要使用大量针对特定任务的培训数据组合和变压器语言模型。我们通过一系列实验来展示这一方法来完成三项分类任务:(1) FR/NFR:分类功能要求与非功能要求;(2) NFR:确定可能的NFR类别;(3)安全分类:安全要求与非安全要求的分类,而不是标注的培训数据;1-NRSL 成果:因此,在FSL1-R任务中采用长期的排序中,研究方法显示FSLI-x。</s>