Recent studies have witnessed the effectiveness of 3D convolutions on segmenting volumetric medical images. Compared with the 2D counterparts, 3D convolutions can capture the spatial context in three dimensions. Nevertheless, models employing 3D convolutions introduce more trainable parameters and are more computationally complex, which may lead easily to model overfitting especially for medical applications with limited available training data. This paper aims to improve the effectiveness and efficiency of 3D convolutions by introducing a novel Group Shift Pointwise Convolution (GSP-Conv). GSP-Conv simplifies 3D convolutions into pointwise ones with 1x1x1 kernels, which dramatically reduces the number of model parameters and FLOPs (e.g. 27x fewer than 3D convolutions with 3x3x3 kernels). Na\"ive pointwise convolutions with limited receptive fields cannot make full use of the spatial image context. To address this problem, we propose a parameter-free operation, Group Shift (GS), which shifts the feature maps along with different spatial directions in an elegant way. With GS, pointwise convolutions can access features from different spatial locations, and the limited receptive fields of pointwise convolutions can be compensated. We evaluate the proposed methods on two datasets, PROMISE12 and BraTS18. Results show that our method, with substantially decreased model complexity, achieves comparable or even better performance than models employing 3D convolutions.
翻译:与 2D 对等方相比, 3D 对立方可以从三个维度中捕捉空间环境。 然而, 3D 相联模式引入了更多的可培训参数, 并且更具有计算性的复杂性, 这可能会很容易导致模型的超配, 特别是医疗应用中, 现有培训数据有限。 本文旨在通过引入新型的集团变换点相联( GSP- Conv) 来提高3D 相联的效益和效率。 普惠制对等将 3D 相联以 1x1x1 内核为点, 大大降低了模型参数和FLOP的数量( 例如, 3D 3x3 内核) 。 使用有限容域的点相联组合无法充分利用空间图像环境。 为了解决这一问题, 我们建议采用无参数操作 Group Shift (GS ), 将地图与不同的空间方向相交替, 甚为GS, 点相联相近的组合和FLOP 能够从不同的空间位置上获取更好的可比较性模型, 。