In software development, due to the lack of knowledge or information, time pressure, complex context, and many other factors, various uncertainties emerge during the development process, leading to assumptions scattered in projects. Being unaware of certain assumptions can result in critical problems (e.g., system vulnerability and failures). The prerequisite of analyzing and understanding assumptions in software development is to identify and extract those assumptions with acceptable effort. In this paper, we proposed a tool (i.e., SCAMiner) to automatically identify and extract self-claimed assumptions (SCAs) on GitHub projects. To evaluate the applicability of SCAMiner, we first presented an example of using the tool to mine SCAs from one large and popular deep learning framework project: the TensorFlow project on GitHub. We then conducted an evaluation of the tool. The results show that SCAMiner can effectively identify and extract SCAs from the repositories on GitHub.
翻译:在软件开发方面,由于缺乏知识或信息、时间压力、复杂背景和许多其他因素,在开发过程中出现了各种不确定因素,导致项目中的假设分散。不了解某些假设可能会造成关键问题(例如系统脆弱性和故障)。在软件开发方面分析和理解假设的先决条件是,以可接受的努力来查明和提取这些假设。在本文件中,我们提出了一个工具(即SCAMiner),用以自动识别和提取GitHub项目的自封假设(SCAS)。为了评估SCAMiner的适用性,我们首先举了一个实例,从一个大型和广受欢迎的深层学习框架项目(即GitHub的TensorFlow项目)中用该工具来开采SCA。我们随后对工具进行了评估。结果显示SCAMiner能够有效地识别和从GitHub的储存库中提取SCA。</s>