Software Vulnerability (SV) severity assessment is a vital task for informing SV remediation and triage. Ranking of SV severity scores is often used to advise prioritization of patching efforts. However, severity assessment is a difficult and subjective manual task that relies on expertise, knowledge, and standardized reporting schemes. Consequently, different data sources that perform independent analysis may provide conflicting severity rankings. Inconsistency across these data sources affects the reliability of severity assessment data, and can consequently impact SV prioritization and fixing. In this study, we investigate severity ranking inconsistencies over the SV reporting lifecycle. Our analysis helps characterize the nature of this problem, identify correlated factors, and determine the impacts of inconsistency on downstream tasks. Our findings observe that SV severity often lacks consideration or is underestimated during initial reporting, and such SVs consequently receive lower prioritization. We identify six potential attributes that are correlated to this misjudgment, and show that inconsistency in severity reporting schemes can severely degrade the performance of downstream severity prediction by up to 77%. Our findings help raise awareness of SV severity data inconsistencies and draw attention to this data quality problem. These insights can help developers better consider SV severity data sources, and improve the reliability of consequent SV prioritization. Furthermore, we encourage researchers to provide more attention to SV severity data selection.
翻译:软件脆弱性(SV) 严重程度评估是向SV提供补救和分级信息的关键任务。 SV重度分数的分级常常被用来为确定补丁工作的优先顺序提供建议。但是,严重程度评估是一项困难和主观的人工任务,依赖专门知识、知识和标准化的报告制度。因此,进行独立分析的不同数据来源可能提供相互冲突的重分等级。这些数据来源的不一致会影响严格度评估数据的可靠性,并因此可能影响SV的优先顺序和确定。在本研究中,我们调查SV报告生命周期的分级不一致程度。我们的分析有助于确定这一问题的性质,查明相关因素,并确定不一致对下游任务的影响。我们的调查结果显示,SV重度评估的重度往往得不到考虑,或者被低估,因此,这种SV的分级排序会降低。我们确定了与这种偏差相关的六个潜在属性,表明,在严格性报告制度方面的不一致会严重地降低下游重度预测的性能,最高达77%。我们的调查结果有助于提高对SV严重程度数据不一致性的认识,并引起对数据质量问题的注意。这些洞见有助于发展者更好地考虑SV严重程度数据选择来源。