Object-oriented software metrics provide a numerical characterization of software quality. They have also been used in the assessment and identification of technical debt. However, metrics generally need to be used with thresholds as reference points that help to interpret their values properly and objectively. The problem is that, while there are many proposed metrics, there are relatively few studies on thresholds and threshold calculation methods; hence, the effective application of metrics in practice has been limited. Moreover, although it has been acknowledged that thresholds should not be absolute, but rather relative to certain contextual factors, the context is still not considered in most threshold studies. In~this~paper, the relationship between system size (as a contextual factor) and metric thresholds is investigated. The objective is to build predictive models that estimate thresholds based solely on system size, and to assess the feasibility of this approach as a threshold estimation method. An empirical study is conducted for this purpose using $36$ defect-prediction datasets and six metrics. The results show that the proposed threshold estimation method is feasible, and it can achieve an accuracy remarkably comparable to more complex threshold models. Moreover, the proposed method has the advantage of being readily usable even for systems that lack historical quality-related data.
翻译:以目标为导向的软件衡量标准提供了软件质量的定量特征,在评估和确定技术债务时也使用了这些指标。但是,指标通常需要以阈值作为参考点来使用,以帮助正确和客观地解释其价值。问题在于,虽然有许多拟议的衡量标准,但关于阈值和阈值计算方法的研究相对较少;因此,在实践中有效应用衡量标准是有限的。此外,尽管人们承认阈值不应是绝对的,而是相对于某些背景因素而言,但大多数临界值研究仍未考虑到这一背景。在本~文件中,系统规模(作为背景因素)与衡量阈值之间的关系得到了调查。目标是建立预测模型,仅根据系统规模估计阈值,并评估这一方法作为阈值估计方法的可行性。为此目的,利用3 600美元的缺陷定位数据集和6个衡量标准进行了实证研究。结果显示,拟议的阈值估计方法是可行的,而且可以达到与更复杂的阈值模型相当的精确度。此外,拟议方法的优点是,即使缺乏历史质量数据的系统也很容易使用。