While reinforcement learning (RL) has proven to be the approach of choice for tackling many complex problems, it remains challenging to develop and deploy RL agents in real-life scenarios successfully. This paper presents pH-RL (personalization in e-Health with RL) a general RL architecture for personalization to bring RL to health practice. pH-RL allows for various levels of personalization in health applications and allows for online and batch learning. Furthermore, we provide a general-purpose implementation framework that can be integrated with various healthcare applications. We describe a step-by-step guideline for the successful deployment of RL policies in a mobile application. We implemented our open-source RL architecture and integrated it with the MoodBuster mobile application for mental health to provide messages to increase daily adherence to the online therapeutic modules. We then performed a comprehensive study with human participants over a sustained period. Our experimental results show that the developed policies learn to select appropriate actions consistently using only a few days' worth of data. Furthermore, we empirically demonstrate the stability of the learned policies during the study.
翻译:强化学习(RL)已被证明是解决许多复杂问题的首选方法,但在现实生活中成功开发和部署RL代理物方面仍具有挑战性。本文介绍了PH-RL(电子保健与RL的个性化)个人化的总体RL结构,以将RL引入健康实践。pH-RL允许在健康应用中实现不同程度的个人化,并允许在线和批量学习。此外,我们提供了一个可以与各种保健应用相结合的通用实施框架。我们描述了在移动应用中成功部署RL政策的分步骤指南。我们实施了开放源的RL结构,并将其与MoodBuster心理健康移动应用结合起来,以提供信息,提高日常对在线治疗模块的遵守程度。我们随后与人类参与者持续进行了一项全面研究。我们的实验结果表明,所制定的政策学会了选择适当行动,仅使用几天有价值的数据。此外,我们从经验上展示了研究期间所学到的政策的稳定性。