The current deep learning approaches for low-dose CT denoising can be divided into paired and unpaired methods. The former involves the use of well-paired datasets, whilst the latter relaxes this constraint. The large availability of unpaired datasets has raised the interest in deepening unpaired denoising strategies that, in turn, need for robust evaluation techniques going beyond the qualitative evaluation. To this end, we can use quantitative image quality assessment scores that we divided into two categories, i.e., paired and unpaired measures. However, the interpretation of unpaired metrics is not straightforward, also because the consistency with paired metrics has not been fully investigated. To cope with this limitation, in this work we consider 15 paired and unpaired scores, which we applied to assess the performance of low-dose CT denoising. We perform an in-depth statistical analysis that not only studies the correlation between paired and unpaired metrics but also within each category. This brings out useful guidelines that can help researchers and practitioners select the right measure for their applications.
翻译:当前的低剂量 CT 去噪深度学习方法可以分为成对和非成对两种。前者涉及使用配对良好的数据集,而后者则放宽了这种限制。非成对数据集的大量可用性引发了对加深非成对去噪策略的兴趣,进而需要强大的评估技术,超越定性评估。为此,我们可以使用量化图像质量评估分数,将其分为成对和非成对两类度量。然而,非成对指标的解释并不直观,这也部分因为尚未完全研究与成对指标的一致性。为解决这个限制,本研究考虑了 15 个成对和非成对分数,用于评估低剂量 CT 去噪的性能。我们进行了深入的统计分析,不仅研究了成对和非成对度量之间的相关性,而且还研究了每个类别内的相关性。这提供了有用的指导方针,可以帮助研究人员和从业者选择适合其应用程序的正确度量。