In recent years, reinforcement learning (RL) has acquired a prominent position in the space of health-related sequential decision-making, becoming an increasingly popular tool for delivering adaptive interventions (AIs). However, despite potential benefits, its real-life application is still limited, partly due to a poor synergy between the methodological and the applied communities. In this work, we provide the first unified survey on RL methods for learning AIs, using the common methodological umbrella of RL to bridge the two AI areas of dynamic treatment regimes and just-in-time adaptive interventions in mobile health. We outline similarities and differences between these two AI domains and discuss their implications for using RL. Finally, we leverage our experience in designing case studies in both areas to illustrate the tremendous collaboration opportunities between statistical, RL, and healthcare researchers in the space of AIs.
翻译:近年来,强化学习(RL)在健康相关的序贯决策中占据了重要地位,越来越成为提供自适应干预(AIs)的流行工具。然而,尽管具有潜在的好处,但其现实应用仍然受到限制,部分原因是方法论和应用社区之间的协同作用不佳。在这项工作中,我们提供了第一份关于使用RL方法学习AIs的统一调查,使用RL的共同方法学伞来链接两个AI领域:动态治疗方案和移动健康中的即时自适应干预。我们概述了这两个AI领域的相似之处和差异,并讨论了它们对使用RL的影响。最后,我们利用在这两个领域设计案例研究的经验,说明了统计、RL和医疗研究人员在AI领域中合作的巨大机会。