With the unprecedented developments in deep learning, many methods are proposed and have achieved great success for medical image segmentation. However, unlike segmentation of natural images, most medical images such as MRI and CT are volumetric data. In order to make full use of volumetric information, 3D CNNs are widely used. However, 3D CNNs suffer from higher inference time and computation cost, which hinders their further clinical applications. Additionally, with the increased number of parameters, the risk of overfitting is higher, especially for medical images where data and annotations are expensive to acquire. To issue this problem, many 2.5D segmentation methods have been proposed to make use of volumetric spatial information with less computation cost. Despite these works lead to improvements on a variety of segmentation tasks, to the best of our knowledge, there has not previously been a large-scale empirical comparison of these methods. In this paper, we aim to present a review of the latest developments of 2.5D methods for volumetric medical image segmentation. Additionally, to compare the performance and effectiveness of these methods, we provide an empirical study of these methods on three representative segmentation tasks involving different modalities and targets. Our experimental results highlight that 3D CNNs may not always be the best choice. Besides, although all these 2.5D methods can bring performance gains to 2D baseline, not all the methods hold the benefits on different datasets. We hope the results and conclusions of our study will prove useful for the community on exploring and developing efficient volumetric medical image segmentation methods.
翻译:由于深层学习史无前例的发展,提出了许多方法,在医学图像分割方面取得了巨大成功,然而,与自然图像分割不同,MRI和CT等大多数医学图像是量量数据。为了充分利用量信息,广泛使用了3DCNN;然而,3DCNN系统由于深度学习的推论时间和计算成本较高,阻碍了其进一步的临床应用。此外,随着参数数量的增加,过分搭配的风险更大,特别是对于数据和说明昂贵的医学图像而言,过分搭配的风险更大。为解决这一问题,提出了许多2.5D分解方法,以较少的计算成本利用体积空间信息。尽管这些工作导致在各种分解任务上有所改进,但根据我们的知识,3D有广泛的经验性比较了这些方法,从而阻碍了其进一步的临床应用。此外,为了比较这些方法的绩效和有效性,我们将对这些方法进行实验性研究,涉及不同模式和指标的三种有代表性的分解任务,尽管这些方法导致了各种分解任务,但是我们的实验性结果并没有使这些基准方法产生全部的希望。