Recent medical image segmentation models are mostly hybrid, which integrate self-attention and convolution layers into the non-isomorphic architecture. However, one potential drawback of these approaches is that they failed to provide an intuitive explanation of why this hybrid combination manner is beneficial, making it difficult for subsequent work to make improvements on top of them. To address this issue, we first analyze the differences between the weight allocation mechanisms of the self-attention and convolution. Based on this analysis, we propose to construct a parallel non-isomorphic block that takes the advantages of self-attention and convolution with simple parallelization. We name the resulting U-shape segmentation model as UNet-2022. In experiments, UNet-2022 obviously outperforms its counterparts in a range segmentation tasks, including abdominal multi-organ segmentation, automatic cardiac diagnosis, neural structures segmentation, and skin lesion segmentation, sometimes surpassing the best performing baseline by 4%. Specifically, UNet-2022 surpasses nnUNet, the most recognized segmentation model at present, by large margins. These phenomena indicate the potential of UNet-2022 to become the model of choice for medical image segmentation.
翻译:最近的医疗图象分解模型大多是混合的,它们将自留层和分解层纳入非异形结构中。但是,这些方法的一个潜在缺点是,它们未能提供直观的解释,说明这种混合组合方式为什么有益,使得随后的工作难以在它们上面作出改进。为了解决这个问题,我们首先分析自我注意和分解的重量分配机制之间的差别。根据这项分析,我们提议建立一个平行的非异形块,利用自留和分解的优势,简单平行化。我们把由此产生的U形分解模型命名为UNet-2022。在实验中,UNet-2022显然超越了在范围分解任务中的对应方,包括腹部多机分解、自动心脏诊断、神经结构分解和皮肤分解,有时超过最佳的基线4%。具体地说,UNet-2022超越了NUNet,这是目前最公认的分解模型。这些现象显示联合国22选择的模型在大边缘成为医学图象的模型。