In this work we focus on automatic segmentation of multiple anatomical structures in (whole body) CT images. Many segmentation algorithms exist for this task. However, in most cases they suffer from 3 problems: 1. They are difficult to use (the code and data is not publicly available or difficult to use). 2. They do not generalize (often the training dataset was curated to only contain very clean images which do not reflect the image distribution found during clinical routine), 3. The algorithm can only segment one anatomical structure. For more structures several algorithms have to be used which increases the effort required to set up the system. In this work we publish a new dataset and segmentation toolkit which solves all three of these problems: In 1204 CT images we segmented 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) covering a majority of relevant classes for most use cases. We show an improved workflow for the creation of ground truth segmentations which speeds up the process by over 10x. The CT images were randomly sampled from clinical routine, thus representing a real world dataset which generalizes to clinical application. The dataset contains a wide range of different pathologies, scanners, sequences and sites. Finally, we train a segmentation algorithm on this new dataset. We call this algorithm TotalSegmentator and make it easily available as a pretrained python pip package (pip install totalsegmentator). Usage is as simple as TotalSegmentator -i ct.nii.gz -o seg and it works well for most CT images. The code is available at https://github.com/wasserth/TotalSegmentator and the dataset at https://doi.org/10.5281/zenodo.6802613.
翻译:在这项工作中,我们侧重于(整体体)CT图像中多个解剖结构的自动分解。 许多解剖算法存在用于此任务。 但是, 在多数情况下, 它们都存在3个问题 : 1. 它们难以使用( 代码和数据无法公开或难以使用 )。 2. 它们不一般化( 培训数据集通常被整理成仅包含非常干净的图像, 无法反映临床常规中发现的图像分布 ) 。 算法只能包含一个解剖结构 。 对于更多的结构来说, 需要使用多个解剖结构来增加建立系统所需的努力 。 但是, 在这项工作中, 我们公布一个新的解析和分解工具 。 在 1204 CT 图像中, 我们分解了104个解结构( 27个器官、 59 骨头、 10 肌肉、 8 船体), 覆盖了大部分相关案例的相关课程。 我们显示为创建地面解析器预解析器的工作流程有所改善, 加速了进程10x。 CT 图像是从临床常规中随机取样的, 代表了一个真实的完整的全局数据集 Squrealal- slavealalal 。