We establish an open framework for developing plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models that were developed by different research groups during the OpenKBP Grand Challenge. The dose predictions were input to four optimization models to form 76 unique KBP pipelines that generated 7600 plans. The predictions and plans were compared to the reference plans via: dose score, which is the average mean absolute voxel-by-voxel difference in dose a model achieved; the deviation in dose-volume histogram (DVH) criterion; and the frequency of clinical planning criteria satisfaction. We also performed a theoretical investigation to justify our dose mimicking models. The range in rank order correlation of the dose score between predictions and their KBP pipelines was 0.50 to 0.62, which indicates that the quality of the predictions is generally positively correlated with the quality of the plans. Additionally, compared to the input predictions, the KBP-generated plans performed significantly better (P<0.05; one-sided Wilcoxon test) on 18 of 23 DVH criteria. Similarly, each optimization model generated plans that satisfied a higher percentage of criteria than the reference plans. Lastly, our theoretical investigation demonstrated that the dose mimicking models generated plans that are also optimal for a conventional planning model. This was the largest international effort to date for evaluating the combination of KBP prediction and optimization models. In the interest of reproducibility, our data and code is freely available at https://github.com/ababier/open-kbp-opt.
翻译:我们为制定基于知识的放射治疗规划(KBP)计划优化模型建立了一个开放框架,我们的框架包括为100名头部和颈部癌症患者制定的参考计划,以及不同研究团体在OpenKBP大挑战期间开发的19个KBP常规模型的高质量剂量预测。剂量预测是对四个优化模型的投入,以形成76个独特的KBP管道,产生7600个计划。预测和计划与参考计划进行了比较,其方式是:剂量评分,这是在剂量方面达到的平均绝对 voxel-byvoxal差异;剂量量直方图(DVH)标准的偏差;临床规划标准的满意度。我们还进行了理论调查,以证明我们的剂量模拟模型及其KBP输油管道之间的按级比值对比范围为0.50至0.62。预测质量与计划质量基本正比。此外,与输入预测相比,KBPBP生成的计划(P < 0.05;1. WilcopHP) 标准偏差;以及临床规划标准的频率为18个国际标准,该标准最终达到23个国际标准。