In recent years, large amounts of electronic health records (EHRs) concerning chronic diseases, such as cancer, diabetes, and mental disease, have been collected to facilitate medical diagnosis. Modeling the dynamic properties of EHRs related to chronic diseases can be efficiently done using dynamic treatment regimes (DTRs), which are a set of sequential decision rules. While Reinforcement learning (RL) is a widely used method for creating DTRs, there is ongoing research in developing RL algorithms that can effectively handle large amounts of data. In this paper, we present a novel approach, a distributed Q-learning algorithm, for generating DTRs. The novelties of our research are as follows: 1) From a methodological perspective, we present a novel and scalable approach for generating DTRs by combining distributed learning with Q-learning. The proposed approach is specifically designed to handle large amounts of data and effectively generate DTRs. 2) From a theoretical standpoint, we provide generalization error bounds for the proposed distributed Q-learning algorithm, which are derived within the framework of statistical learning theory. These bounds quantify the relationships between sample size, prediction accuracy, and computational burden, providing insights into the performance of the algorithm. 3) From an applied perspective, we demonstrate the effectiveness of our proposed distributed Q-learning algorithm for DTRs by applying it to clinical cancer treatments. The results show that our algorithm outperforms both traditional linear Q-learning and commonly used deep Q-learning in terms of both prediction accuracy and computation cost.
翻译:近年来,收集了大量关于癌症、糖尿病和精神疾病等慢性病的电子健康记录,以便利医学诊断;利用一套顺序决定规则,即动态治疗制度(DTRs),可以有效地模拟与慢性病有关的电子健康记录动态特性;虽然强化学习(RL)是广泛用来创建DTR的一种方法,但正在研究如何开发能够有效处理大量数据的RL算法;在本文件中,我们提出了一种新颖的方法,即用于生成DTR的分布式Q学习算法。我们研究的新颖之处如下:(1)从方法角度,我们提出了一种创新和可扩展的方法,通过将分布式学习与Q学习相结合,生成DTRs动态特性特性特性特性特性;拟议方法具体旨在处理大量数据和有效地生成DTRs。(2) 从理论角度,我们为拟议的分布式的Q-学习算法提供了一般误差界限,这是在统计学习理论框架内推算的。这些新颖的Q-我们研究的新颖的样本规模、预测准确性和计算方法之间的关联性关系,从传统的计算方法的角度,我们提出了一种新式的计算方法,从应用的计算方法,从分析质量到我们所应用的计算结果的计算结果,用到我们所应用的计算结果的计算结果,从运用的常规-分析过程的进度分析结果,用到我们用来显示的进度分析结果的进度分析结果,用来显示我们用来显示我们所研的进度的进度的进度。