Convolutional neural networks (CNNs) are increasingly being used to automate segmentation of organs-at-risk in radiotherapy. Since large sets of highly curated data are scarce, we investigated how much data is required to train accurate and robust head and neck auto-segmentation models. For this, an established 3D CNN was trained from scratch with different sized datasets (25-1000 scans) to segment the brainstem, parotid glands and spinal cord in CTs. Additionally, we evaluated multiple ensemble techniques to improve the performance of these models. The segmentations improved with training set size up to 250 scans and the ensemble methods significantly improved performance for all organs. The impact of the ensemble methods was most notable in the smallest datasets, demonstrating their potential for use in cases where large training datasets are difficult to obtain.
翻译:卷积神经网络(CNN)越来越多地用于自动化放射治疗中的危及器官分割。由于缺乏大量高度精选的数据集,因此我们研究了需要多少数据来训练准确和稳健的头颈自动分割模型。为此,使用不同大小的数据集(25-1000个扫描)从头开始训练了一个已建立的3D CNN,以在CT中分割脑干、腮腺和脊髓。此外,我们评估了多种集成技术以提高这些模型的性能。随着训练集大小的增加,分割结果提高,最多可达到250个扫描,而集成方法显著提高了所有器官的性能。集成方法在最小的数据集中影响最为明显,说明它们在难以获得大型训练数据集的情况下可能会发挥作用。