Context: Due to the association of significant efforts, even a minor improvement in the effectiveness of Code Reviews(CR) can incur significant savings for a software development organization. Aim: This study aims to develop a finer grain understanding of what makes a code review comment useful to OSS developers, to what extent a code review comment is considered useful to them, and how various contextual and participant-related factors influence its usefulness level. Method: On this goal, we have conducted a three-stage mixed-method study. We randomly selected 2,500 CR comments from the OpenDev Nova project and manually categorized the comments. We designed a survey of OpenDev developers to better understand their perspectives on useful CRs. Combining our survey-obtained scores with our manually labeled dataset, we trained two regression models - one to identify factors that influence the usefulness of CR comments and the other to identify factors that improve the odds of `Functional' defect identification over the others. Key findings: The results of our study suggest that a CR comment's usefulness is dictated not only by its technical contributions such as defect findings or quality improvement tips but also by its linguistic characteristics such as comprehensibility and politeness. While a reviewer's coding experience positively associates with CR usefulness, the number of mutual reviews, comment volume in a file, the total number of lines added /modified, and CR interval has the opposite associations. While authorship and reviewership experiences for the files under review have been the most popular attributes for reviewer recommendation systems, we do not find any significant association of those attributes with CR usefulness.
翻译:目标:本项研究旨在更精细地了解如何使守则审查评论对开放源码软件开发者有用,在多大程度上认为守则审查评论对其有用,以及各种背景因素和参与者相关因素如何影响其有用程度。方法:关于这一目标,我们进行了三阶段混合方法研究。我们随机从OpenDev Nova项目中挑选了2,500个CR评论,并手工分类了评论。我们设计了对OpenDev开发者的调查,以更好地了解他们对有用的CRs的看法。将我们调查获得的成绩与我们人工贴标签的数据集结合起来,我们培训了两个回归模型,其中一是查明影响公司评论有用性的因素,另一是查明“实用性”缺陷的可能性。 方法:关于这一目标,我们进行了一项三阶段混合方法研究的结果表明,CR评论的有用性不仅取决于其技术贡献,如缺陷或质量改进提示,而且还取决于其语言特性。 将我们的调查所得的得分数与我们手动的数据集的准确性审评结果一样,对于CRrational 和CRiversal的准确性审评具有积极性。