Dynamic treatment regimes (DTRs) aim at tailoring individualized sequential treatment rules that maximize cumulative beneficial outcomes by accommodating patient's heterogeneity into decision making. For many chronic diseases including type 2 diabetes mellitus (T2D), treatments are usually multifaceted in the sense that the aggressive treatments with higher expected reward are also likely to elevate the risk of acute adverse events. In this paper, we propose a new weighted learning framework, namely benefit-risk dynamic treatment regimes (BR-DTRs), to address the benefit-risk trade-off. The new framework relies on a backward learning procedure by restricting the induced risk of the treatment rule to be no larger than a pre-specified risk constraint at each treatment stage. Computationally, the estimated treatment rule solves a weighted support vector machine problem with a modified smooth constraint. Theoretically, we show that the proposed DTRs are Fisher consistent and we further obtain the convergence rates for both the value and risk functions. Finally, the performance of the proposed method is demonstrated via extensive simulation studies and application to a real study for T2D patients.
翻译:动态治疗制度(DTRs)旨在调整个性化的连续治疗规则,通过照顾病人的异质性,在决策中实现累积效益最大化;对于许多慢性疾病,包括2型糖尿病(T2D),治疗通常具有多面性,因为预期报酬较高的攻击性治疗也有可能增加严重不利事件的风险;在本文件中,我们提议一个新的加权学习框架,即利益风险动态治疗制度(BR-DTRs),以解决利益风险权衡问题;新框架依靠后退学习程序,限制治疗规则的诱发风险,限制其在每个治疗阶段不超过预先确定的风险限制。估计治疗规则解决加权支持矢量机问题,并修正了平稳限制。理论上,我们表明拟议的DTRs是渔业的,我们进一步获得了价值和风险功能的趋同率。最后,通过广泛的模拟研究和应用对T2D病人的实际研究来证明拟议方法的绩效。