Deep learning (DL) is the state-of-the-art methodology in various medical image segmentation tasks. However, it requires relatively large amounts of manually labeled training data, which may be infeasible to generate in some applications. In addition, DL methods have relatively poor generalizability to out-of-sample data. Multi-atlas segmentation (MAS), on the other hand, has promising performance using limited amounts of training data and good generalizability. A hybrid method that integrates the high accuracy of DL and good generalizability of MAS is highly desired and could play an important role in segmentation problems where manually labeled data is hard to generate. Most of the prior work focuses on improving single components of MAS using DL rather than directly optimizing the final segmentation accuracy via an end-to-end pipeline. Only one study explored this idea in binary segmentation of 2D images, but it remains unknown whether it generalizes well to multi-class 3D segmentation problems. In this study, we propose a 3D end-to-end hybrid pipeline, named deep label fusion (DLF), that takes advantage of the strengths of MAS and DL. Experimental results demonstrate that DLF yields significant improvements over conventional label fusion methods and U-Net, a direct DL approach, in the context of segmenting medial temporal lobe subregions using 3T T1-weighted and T2-weighted MRI. Further, when applied to an unseen similar dataset acquired in 7T, DLF maintains its superior performance, which demonstrates its good generalizability.
翻译:深度学习(DL)是各种医学图像分割任务中最先进的方法。然而,它需要大量手工贴标签的培训数据,在某些应用中可能无法生成。此外,DL方法对于标外数据具有相对较差的通用性。多分解(MAS)则使用数量有限的培训数据和良好的一般性能,取得了有希望的性能。一种混合方法将DL的高精确度和高超质量质量的数学分解问题结合起来,在手工贴标签数据难以生成的分解问题中可以发挥重要作用。以前的工作大多侧重于使用DL改进MAS的单一部件,而不是直接通过端对端管道优化最终的分解准确性。只有一项研究在2D图像的双分解中探讨了这一想法,但目前还不清楚它是否概括了多级的3D分解问题。在这项研究中,我们建议采用3D-端混合管道,将深度标签在T-网络上很难生成数据。以前的工作重点是用DLF1的高级分解方法,从而展示了其常规分解方法的优势。