Background and objective: In this paper, a modified U-Net based framework is presented, which leverages techniques from Squeeze-and-Excitation (SE) block, Atrous Spatial Pyramid Pooling (ASPP) and residual learning for accurate and robust liver CT segmentation, and the effectiveness of the proposed method was tested on two public datasets LiTS17 and SLiver07. Methods: A new network architecture called SAR-U-Net was designed. Firstly, the SE block is introduced to adaptively extract image features after each convolution in the U-Net encoder, while suppressing irrelevant regions, and highlighting features of specific segmentation task; Secondly, ASPP was employed to replace the transition layer and the output layer, and acquire multi-scale image information via different receptive fields. Thirdly, to alleviate the degradation problem, the traditional convolution block was replaced with the residual block and thus prompt the network to gain accuracy from considerably increased depth. Results: In the LiTS17 experiment, the mean values of Dice, VOE, RVD, ASD and MSD were 95.71, 9.52, -0.84, 1.54 and 29.14, respectively. Compared with other closely related 2D-based models, the proposed method achieved the highest accuracy. In the experiment of the SLiver07, the mean values of Dice, VOE, RVD, ASD and MSD were 97.31, 5.37, -1.08, 1.85 and 27.45, respectively. Compared with other closely related models, the proposed method achieved the highest segmentation accuracy except for the RVD. Conclusion: The proposed model enables a great improvement on the accuracy compared to 2D-based models, and its robustness in circumvent challenging problems, such as small liver regions, discontinuous liver regions, and fuzzy liver boundaries, is also well demonstrated and validated.
翻译:在本文中,介绍了一个基于U-Net的修改框架,该框架利用了来自Squeeze-Expuration(SE)块、Astrous Space Pyramid Compluning(ASPP)和残留学习的技术,以准确和稳健的肝脏CT分割,并在两个公共数据集LITS17和SLiver07中测试了拟议方法的有效性。方法:设计了一个称为SAR-U-Net的新网络架构。首先,SE区在U-Net编码器每次变速后,通过适应性提取图像特征,同时压制不相关的区域,并突出具体稳定的分离任务;第二,ASPP用来取代过渡层和产出层,通过不同的接收场获取多尺度图像信息。 第三,传统变速区被替换为残余区,从而促使网络从深度大幅提高获得准确性。 在LTS17号试验中,基于Dice、VE、RVDD、ASD和MSD的平均值值值,在95.71、9.52、ASD和MSD的精确度分析中分别与Seral-L相关的模型、1.54和2。