Transformer, a model comprising attention-based encoder-decoder architecture, have gained prevalence in the field of natural language processing (NLP) and recently influenced the computer vision (CV) space. The similarities between computer vision and medical imaging, reviewed the question among researchers if the impact of transformers on computer vision be translated to medical imaging? In this paper, we attempt to provide a comprehensive and recent review on the application of transformers in medical imaging by; describing the transformer model comparing it with a diversity of convolutional neural networks (CNNs), detailing the transformer based approaches for medical image classification, segmentation, registration and reconstruction with a focus on the image modality, comparing the performance of state-of-the-art transformer architectures to best performing CNNs on standard medical datasets.
翻译:由关注型编码器-解码器结构构成的模型变换器在自然语言处理领域已占上风,最近对计算机视觉空间产生了影响。计算机视觉与医学成像之间的相似之处,审查了研究人员之间如果将变压器对计算机视觉的影响转化为医学成像的问题。在本文中,我们试图通过以下方式对变压器在医学成像中的应用进行全面和近期审查:描述变压器模型,将其与革命神经网络的多样性进行比较,详细说明基于变压器的医学图像分类、分解、注册和重建方法,重点是图像模式,比较最先进的变压器结构的性能,以在标准医学数据集上最佳地运行CNN。