A meaningful and deep understanding of the human aspects of software engineering (SE) requires psychological constructs to be considered. Psychology theory can facilitate the systematic and sound development as well as the adoption of instruments (e.g., psychological tests, questionnaires) to assess these constructs. In particular, to ensure high quality, the psychometric properties of instruments need evaluation. In this paper, we provide an introduction to psychometric theory for the evaluation of measurement instruments for SE researchers. We present guidelines that enable using existing instruments and developing new ones adequately. We conducted a comprehensive review of the psychology literature framed by the Standards for Educational and Psychological Testing. We detail activities used when operationalizing new psychological constructs, such as item pooling, item review, pilot testing, item analysis, factor analysis, statistical property of items, reliability, validity, and fairness in testing and test bias. We provide an openly available example of a psychometric evaluation based on our guideline. We hope to encourage a culture change in SE research towards the adoption of established methods from psychology. To improve the quality of behavioral research in SE, studies focusing on introducing, validating, and then using psychometric instruments need to be more common.
翻译:对软件工程(SE)的人类方面进行有意义和深入的了解需要考虑心理结构。心理学理论可以促进系统和健全的发展,以及采用评估这些结构的工具(例如心理测试、问卷调查),特别是为了确保文书的心理特征具有高质量,需要对这些特征进行评估。在本文件中,我们介绍了用于评价SE研究人员测量工具的心理计量理论。我们提出了能够利用现有工具并充分开发新工具的指南。我们全面审查了教育和心理测试标准所构建的心理学文献。我们详细介绍了在操作新的心理结构时使用的活动,例如项目集合、项目审查、实验性测试、项目分析、要素分析、物品的统计属性、可靠性、有效性以及测试和测试偏向性。我们提供了一个公开的基于我们的准则进行心理计量评估的范例。我们希望鼓励SE进行文化研究,以便采用既有的心理学方法。我们想提高SE的行为研究的质量,研究的重点是引入、验证,然后使用心理测量工具。我们需要更加普遍。