Forks play a central role in modern pull-based OSS development. Although rich empirical results on the participants, challenges, and features of forks have been announced, there is little discussion on quantitatively measuring the population of forks around OSS projects. In this paper, we take a step toward enriching the set of metrics about forks by proposing the fork entropy to measure the diversity of fork populations around OSS projects. We operationalize the proposed fork entropy based on Rao's quadratic entropy with a distance function defined on the forks' modifications to project files. After verifying the construct validity of fork entropy, we show the usefulness of fork entropy in understanding and predicting OSS development in terms of external productivity, the acceptance rate of external pull-requests, and code quality using a dataset consisting of fifty popular OSS projects hosted on GitHub. By conducting regression analyses, we find that fork entropy significantly and positively affects external productivity, the acceptance rate of external pull-requests, and code quality, even though sometimes with a small effect. However, as expected, fork entropy at a high level sometimes plays a negative role in OSS development. We also observe fork entropy can magically moderate other factors' effect on some project outcomes. We believe our new metric of fork entropy is helpful to guide practices of OSS development.
翻译:福克斯在现代基于拉动的开放源码软件发展中发挥着核心作用。 尽管已经宣布了参与者、挑战和叉子特征方面的丰富经验结果,但很少讨论对开放源码软件项目周围叉子人口数量进行定量测量的问题。 在本文中,我们迈出了一步,通过提出叉叉子酶,以测量开放源码软件项目周围叉子人口的多样性,丰富关于叉子的一套衡量标准。我们实施了基于Rao's 的四方形对项目文档修改功能界定的距离功能的拟议叉子酶。在核查叉子昆虫的构造有效性之后,我们展示了在理解和预测开放源码软件的外部生产率、外部拉动请求的接受率和代码质量方面叉子的实用性。我们通过进行回归分析,我们发现叉子对外部生产力、外部拉动要求的接受率和代码质量有着显著和积极的影响,尽管有时作用很小,但我们展示了在理解和预测的磁性模型开发结果方面使用叉子的叉子,但是,我们有时在高端观察了我们正方形的磁性模型。