In recent years, the research on empirical software engineering that uses qualitative data analysis (e.g., cases studies, interview surveys, and grounded theory studies) is increasing. However, most of this research does not deep into the reliability and validity of findings, specifically in the reliability of coding in which these methodologies rely on, despite there exist a variety of statistical techniques known as Inter-Coder Agreement (ICA) for analyzing consensus in team coding. This paper aims to establish a novel theoretical framework that enables a methodological approach for conducting this validity analysis. This framework is based on a set of coefficients for measuring the degree of agreement that different coders achieve when judging a common matter. We analyze different reliability coefficients and provide detailed examples of calculation, with special attention to Krippendorff's $\alpha$ coefficients. We systematically review several variants of Krippendorff's $\alpha$ reported in the literature and provide a novel common mathematical framework in which all of them are unified through a universal $\alpha$ coefficient. Finally, this paper provides a detailed guide of the use of this theoretical framework in a large case study on DevOps culture. We explain how $\alpha$ coefficients are computed and interpreted using a widely used software tool for qualitative analysis like Atlas.ti. We expect that this work will help empirical researchers, particularly in software engineering, to improve the quality and trustworthiness of their studies.
翻译:近年来,使用定性数据分析(例如案例研究、访谈调查和有根据的理论研究)的经验软件工程研究正在增加,但是,大多数研究没有深入了解调查结果的可靠性和有效性,特别是这些方法所依赖的编码的可靠性,尽管存在各种统计技术,称为《部门间协议》(ICA),用于分析团队编码中的共识。本文件旨在建立一个新的理论框架,使进行这种有效性分析的方法能够采用新的理论框架。这一框架以一套系数为基础,用以衡量不同编码员在判断一个共同事项时达成的协议程度。我们分析不同的可靠性系数,并提供详细的计算实例,特别注意Krippendorff的美元和法元的系数。我们系统地审查文献中报告的Krippendorff的美元(ICA)的几种变式,并提供一个新的共同数学框架,通过一个通用的 $/alpha 系数加以统一。最后,本文提供了在一项大型案例研究中使用这一理论框架的详尽指南,用于对DevOps文化的可靠性进行分析,并提供详细的计算实例,特别注意Krippendorfff's的美元系数。我们系统地研究如何用这种分析,例如,我们用Altalimalimstal orpormas。