Many healthcare decisions involve navigating through a multitude of treatment options in a sequential and iterative manner to find an optimal treatment pathway with the goal of an optimal patient outcome. Such optimization problems may be amenable to reinforcement learning. A reinforcement learning agent could be trained to provide treatment recommendations for physicians, acting as a decision support tool. However, a number of difficulties arise when using RL beyond benchmark environments, such as specifying the reward function, choosing an appropriate state representation and evaluating the learned policy.
翻译:许多保健决定涉及以连续和迭接的方式探索多种治疗选择,以找到最佳治疗途径,达到最佳患者结果的目标,这种优化问题可能有利于强化学习,可以培训强化学习机构为医生提供治疗建议,作为一种决策支持工具,但是,在使用超出基准环境的RL时,会出现一些困难,例如具体规定奖励功能,选择适当的国家代表,评估学习的政策。