[Background] In large open-source software projects, development knowledge is often fragmented across multiple artefacts and contributors such that individual stakeholders are generally unaware of the full breadth of the product features. However, users want to know what the software is capable of, while contributors need to know where to fix, update, and add features. [Objective] This work aims at understanding how feature knowledge is documented in GitHub projects and how it is linked (if at all) to the source code. [Method] We conducted an in-depth qualitative exploratory content analysis of 25 popular GitHub repositories that provided the documentation artefacts recommended by GitHub's Community Standards indicator. We first extracted strategies used to document software features in textual artefacts and then strategies used to link the feature documentation with source code. [Results] We observed feature documentation in all studied projects in artefacts such as READMEs, wikis, and website resource files. However, the features were often described in an unstructured way. Additionally, tracing techniques to connect feature documentation and source code were rarely used. [Conclusions] Our results suggest a lacking (or a low-prioritised) feature documentation in open-source projects, little use of normalised structures, and a rare explicit referencing to source code. As a result, product feature traceability is likely to be very limited, and maintainability to suffer over time.
翻译:在大型开放源码软件项目中,发展知识往往分散于多种手工艺品和贡献者,因此,个人利益攸关方一般不知道产品特征的全方位,但用户想知道软件能够做什么,而贡献者则需要知道如何修复、更新和添加特征。[目标]这项工作的目的是了解GitHub项目中如何记录特征知识,以及如何(如果有的话)将其与源代码联系起来。[方法]我们对25个受欢迎的GitHub储存库进行了深入的定性探索内容分析,这些储存库提供了GitHub社区标准指标建议的文件工艺品。我们首先提取了用于将软件特征特征与源码联系起来的战略,而随后又采用了将特征文件与源码连接起来的战略。[Results]我们观察了GitHub项目中所有研究的特征知识如何记录,以及如何(如果有的话)与源码联系起来。然而,这些特征往往以不结构化的方式描述。此外,很少使用连接特征文档和源码的追踪技术。[封闭]我们的结果表明,通常缺乏一种开放的、或原始的源码。