Convolutional Neural Networks (CNNs) have achieved promising results in medical image segmentation. However, CNNs require lots of training data and are incapable of handling pose and deformation of objects. Furthermore, their pooling layers tend to discard important information such as positions as well as CNNs are sensitive to rotation and affine transformation. Capsule network is a recent new architecture that has achieved better robustness in part-whole representation learning by replacing pooling layers with dynamic routing and convolutional strides, which has shown potential results on popular tasks such as digit classification and object segmentation. In this paper, we propose a 3D encoder-decoder network with Convolutional Capsule Encoder (called 3DConvCaps) to learn lower-level features (short-range attention) with convolutional layers while modeling the higher-level features (long-range dependence) with capsule layers. Our experiments on multiple datasets including iSeg-2017, Hippocampus, and Cardiac demonstrate that our 3D 3DConvCaps network considerably outperforms previous capsule networks and 3D-UNets. We further conduct ablation studies of network efficiency and segmentation performance under various configurations of convolution layers and capsule layers at both contracting and expanding paths.
翻译:然而,CNN需要大量培训数据,无法处理表面和变形物体。此外,它们的集合层往往丢弃重要信息,如位置和CNN等,对旋转和亲吻转变十分敏感。Capsule网络是一个新的结构,它通过用动态路线和进化步伐取代集合层,从而在部分整体学习方面取得了更好的稳健性,从而用动态路线和进化步伐来取代集合层,这显示了在数字分类和对象分割等流行任务方面的潜在结果。在本文中,我们提议与3D Convolual Capule Encoder(称为3D ConConCapts)建立一个3D 编码解密器网络,以学习与革命层相比较低层次的特征(近距离关注),同时用胶囊层来模拟更高层次的特征(长距离依赖性),我们在包括iSeg-2017、Hippocampus和Cardiac等多个数据集的实验表明,我们的3D Conv Capts网络大大超越了先前的胶囊化网络和3D-Conapts-Dlational contraculation 的升级网络和在不断扩展的网络中进行一个不断扩展和升级的系统化的升级和升级和升级和升级的网络的升级的功能研究。