In open source software (OSS) communities, existing leadership indicators are dominantly measured by code contribution or community influence. Recent studies on emergent leadership shed light on additional dimensions such as intellectual stimulation in collaborative communications. To that end, this paper proposes an automated approach, named iLead, to mine communication styles and identify emergent leadership behaviors in OSS communities, using issue comments data. We start with the construction of 6 categories of leadership behaviors based on existing leadership studies. Then, we manually label leadership behaviors in 10,000 issue comments from 10 OSS projects, and extract 304 heuristic linguistic patterns which represent different types of emergent leadership behaviors in flexible and concise manners. Next, an automated algorithm is developed to merge and consolidate different pattern sets extracted from multiple projects into a final pattern ranking list, which can be applied for the automatic leadership identification. The evaluation results show that iLead can achieve a median precision of 0.82 and recall of 0.78, outperforming ten machine/deep learning baselines. To demonstrate practical usefulness, we also conduct empirical analysis and human evaluation of the identified leadership behaviors from iLead. We argue that emergent leadership behaviors in issue discussion should be taken into consideration to broaden existing OSS leadership viewpoints. Practical insights on community building and leadership skill development are offered for OSS community and individual developers, respectively.
翻译:在开放源码软件(OSS)社区,现有领导力指标主要以代码贡献或社区影响来衡量。最近关于新兴领导力的研究揭示了其他层面,例如合作通信中的智力刺激。为此,本文件建议对通信风格采用名为iLead的自动化方法,并使用问题评论数据,确定开放源码软件社区的新兴领导力行为。我们从基于现有领导力研究的6类领导力行为开始,首先根据现有领导力研究,在10个OSS项目10个问题评论中将领导力行为分类为10个问题评论,并提取304种超常语言模式,这些模式以灵活和简洁的方式代表不同类型的新兴领导力行为。接下来,开发一种自动化算法,将从多个项目中提取的不同模式合并并整合为最终模式排名清单,用于自动确定领导力。评价结果显示,iLead可以达到0.82的中位精确度,回顾0.78的机器/深层学习基线。为了证明实际有用,我们还从iLead对已查明的领导力行为进行实证分析和人文评估。我们指出,在建设领导能力方面,应当分别对社区进行实见识判断,扩大社区进行个人认识。