To achieve the goal of providing the best possible care to each patient, physicians need to customize treatments for patients with the same diagnosis, especially when treating diseases that can progress further and require additional treatments, such as cancer. Making decisions at multiple stages as a disease progresses can be formalized as a dynamic treatment regime (DTR). Most of the existing optimization approaches for estimating dynamic treatment regimes including the popular method of Q-learning were developed in a frequentist context. Recently, a general Bayesian machine learning framework that facilitates using Bayesian regression modeling to optimize DTRs has been proposed. In this article, we adapt this approach to censored outcomes using Bayesian additive regression trees (BART) for each stage under the accelerated failure time modeling framework, along with simulation studies and a real data example that compare the proposed approach with Q-learning. We also develop an R wrapper function that utilizes a standard BART survival model to optimize DTRs for censored outcomes. The wrapper function can easily be extended to accommodate any type of Bayesian machine learning model.
翻译:为了实现向每个病人提供尽可能最好的护理的目标,医生需要将诊断结果相同的病人的治疗,特别是治疗能够进一步取得进展并需要更多治疗的疾病,如癌症。作为疾病进展在多个阶段作出决定可以正式确定为动态治疗制度(DTR)。现有用于估计动态治疗制度(包括普及的Q-学习方法)的优化方法大多是在常年环境下制定的。最近,提出了一个通用的巴伊西亚机器学习框架,便利使用巴伊西亚回归模型优化DTR。在本条中,我们将这一方法适应于在加速失败时间模型框架下每个阶段采用Bayesian添加的回归树(BART)的检查结果,同时进行模拟研究,并树立将拟议方法与Q-学习进行比较的真实数据范例。我们还开发了一个R包装功能,利用标准BART生存模型优化DTR,以取得经审查的结果。包装功能可以很容易扩展,以适应任何类型的巴伊西亚机器学习模式。