Open-source Software (OSS) has become a valuable resource in both industry and academia over the last few decades. Despite the innovative structures they develop to support the projects, OSS projects and their communities have complex needs and face risks such as getting abandoned. To manage the internal social dynamics and community evolution, OSS developer communities have started relying on written governance documents that assign roles and responsibilities to different community actors. To facilitate the study of the impact and effectiveness of formal governance documents on OSS projects and communities, we present a longitudinal dataset of 710 GitHub-hosted OSS projects with \path{GOVERNANCE.MD} governance files. This dataset includes all commits made to the repository, all issues and comments created on GitHub, and all revisions made to the governance file. We hope its availability will foster more research interest in studying how OSS communities govern their projects and the impact of governance files on communities.
翻译:开源软件(OSS)在过去几十年中已成为工业和学术界的宝贵资源。尽管他们开发了创新的结构来支持其项目,但OSS项目及其社区具有复杂的需求并面临被废弃的风险。为了管理内部社会动态和社区演变,OSS开发者社区开始依赖于书面治理文件,将不同的社区参与者分配给不同的角色和责任。为了促进研究正式治理文件对OSS项目和社区的影响和有效性,我们提供了一个纵向数据集,其中包含710个在GitHub上托管的OSS项目,其中包含“GITHUB-MD”治理文件。该数据集包括对存储库进行的所有提交,GitHub上创建的所有问题和评论以及对治理文件进行的所有修订。我们希望它的可用性将促进更多关于研究OSS社区如何治理其项目以及治理文件对社区的影响的研究兴趣。