Over the past few years, the rapid development of deep learning technologies for computer vision has greatly promoted the performance of medical image segmentation (MedISeg). However, the recent MedISeg publications usually focus on presentations of the major contributions (e.g., network architectures, training strategies, and loss functions) while unwittingly ignoring some marginal implementation details (also known as "tricks"), leading to a potential problem of the unfair experimental result comparisons. In this paper, we collect a series of MedISeg tricks for different model implementation phases (i.e., pre-training model, data pre-processing, data augmentation, model implementation, model inference, and result post-processing), and experimentally explore the effectiveness of these tricks on the consistent baseline models. Compared to paper-driven surveys that only blandly focus on the advantages and limitation analyses of segmentation models, our work provides a large number of solid experiments and is more technically operable. With the extensive experimental results on both the representative 2D and 3D medical image datasets, we explicitly clarify the effect of these tricks. Moreover, based on the surveyed tricks, we also open-sourced a strong MedISeg repository, where each of its components has the advantage of plug-and-play. We believe that this milestone work not only completes a comprehensive and complementary survey of the state-of-the-art MedISeg approaches, but also offers a practical guide for addressing the future medical image processing challenges including but not limited to small dataset learning, class imbalance learning, multi-modality learning, and domain adaptation. The code has been released at: https://github.com/hust-linyi/MedISeg
翻译:在过去几年里,计算机视觉的深层学习技术的迅速发展极大地促进了医学图像分割(MedISeg)的绩效,然而,最近MedISeg的出版物通常侧重于主要贡献的表述(例如网络结构、培训战略和损失功能),而无意中忽略了某些边际执行细节(又称“tricks”),导致了不公平的实验结果比较的潜在问题。在本文中,我们收集了一系列针对不同模型执行阶段(即培训前模型、数据预处理、数据增强、模型实施、模型的分类和结果处理后的结果)的MedISeg技巧,并实验性地探索这些主要贡献(例如网络结构、培训战略和损失功能)的展示(例如网络结构、培训战略和损失功能)的展示,同时无意中忽略了某些边际执行细节的细节细节(又称为“tricks” ),导致大量可靠的实验和技术上的可操作性实验。在代表2D和3D公布的医疗图像元数据集方面,我们明确澄清了这些技巧的效果。此外,在调查技巧的细微处理方法上,我们还相信每个医学图案的图中都有了一种开放的模型的模型,我们也相信每个模型的模型的模型的模型的模型的学习。