Super-resolution plays an essential role in medical imaging because it provides an alternative way to achieve high spatial resolutions and image quality with no extra acquisition costs. In the past few decades, the rapid development of deep neural networks has promoted super-resolution performance with novel network architectures, loss functions and evaluation metrics. Specifically, vision transformers dominate a broad range of computer vision tasks, but challenges still exist when applying them to low-level medical image processing tasks. This paper proposes an efficient vision transformer with residual dense connections and local feature fusion to achieve efficient single-image super-resolution (SISR) of medical modalities. Moreover, we implement a general-purpose perceptual loss with manual control for image quality improvements of desired aspects by incorporating prior knowledge of medical image segmentation. Compared with state-of-the-art methods on four public medical image datasets, the proposed method achieves the best PSNR scores of 6 modalities among seven modalities. It leads to an average improvement of $+0.09$ dB PSNR with only 38\% parameters of SwinIR. On the other hand, the segmentation-based perceptual loss increases $+0.14$ dB PSNR on average for SOTA methods, including CNNs and vision transformers. Additionally, we conduct comprehensive ablation studies to discuss potential factors for the superior performance of vision transformers over CNNs and the impacts of network and loss function components. The code will be released on GitHub with the paper published.
翻译:超分辨率在医学成像中发挥着不可或缺的作用,因为它提供了实现高空间分辨率和图像质量的替代方法,而没有额外的获取成本。在过去几十年中,深神经网络的迅速发展促进了以新型网络结构、损失功能和评价衡量标准促进超分辨率性能,具体地说,视觉变压器主导着广泛的计算机视觉任务,但在应用这些变压器执行低水平医疗图像处理任务时仍然存在挑战。本文件建议了高效的视觉变压器,该变压器具有残留的密集连接和本地特性聚合,以实现医疗模式的高效单一图像超分辨率(SISR)。此外,我们实施了普通目的的感知损失,同时通过将医学图像分解的先前知识、损失函数和图像质量的改善进行人工控制。与四个公共医学图像数据集中最先进的方法相比,拟议方法在7种模式中达到6种最佳的PSNR分数。它导致SwinIR中仅有38<unk> 参数的美元+0.09元的PSNRRR。另一方面,我们基于分位的图像损失和图像变压值的图像变压率将增加SO++NIS变压的SLM的功能。</s>