Object-oriented software metrics provide a numerical characterization of software quality. They have also been used in the assessment and identification of technical debt. However, metrics generally need to be used with thresholds as reference points that help to interpret their values properly and objectively. The problem is that, while there are many proposed metrics, there are relatively few studies on thresholds and threshold calculation methods; hence, the effective application of metrics in practice has been limited. Moreover, although it has been acknowledged that thresholds should not be absolute, but rather relative to certain contextual factors, the context is still not considered in most threshold studies. In this paper, the relationship between system size (as a contextual factor) and metric thresholds is investigated. The objective is to build predictive models that estimate thresholds based solely on system size, and to assess the feasibility of this approach as a threshold estimation method. An empirical study is conducted for this purpose using 36 defect-prediction datasets and six metrics. The results show that the proposed threshold estimation method is feasible, and it can achieve an accuracy remarkably comparable to more complex threshold models.
翻译:面向目标的软件衡量标准提供了软件质量的定量特征,在评估和确定技术债务时也使用了这些指标。但是,指标通常需要以阈值作为参考点来使用,以帮助正确和客观地解释其价值。问题是,虽然有许多拟议的衡量标准,但关于阈值和阈值计算方法的研究相对较少;因此,在实践中有效应用衡量标准是有限的。此外,虽然人们承认阈值不应是绝对的,而是相对于某些背景因素而言,但大多数临界值研究仍未考虑到这一背景。本文调查了系统规模(作为背景因素)与衡量阈值之间的关系。目标是建立预测模型,仅根据系统规模估计阈值,并评估这一方法作为阈值估计方法的可行性。为此目的,利用36个缺陷数据集和6个计量标准进行了实证研究。结果显示,拟议的阈值估计方法是可行的,其准确性可以与更复杂的阈值模型相当。