个性化了吗？使用重抽样的在线强化学习算法评估个性化 (Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling) - 专知论文

会员服务 ·

0

个性化服务 · 算法 · 强化学习算法 · 在线 · 数据驱动 ·

2023 年 4 月 11 日

Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling

翻译：个性化了吗？使用重抽样的在线强化学习算法评估个性化

Susobhan Ghosh,Raphael Kim,Prasidh Chhabria,Raaz Dwivedi,Predrag Klasjna,Peng Liao,Kelly Zhang,Susan Murphy

from arxiv, The first two authors contributed equally

There is a growing interest in using reinforcement learning (RL) to personalize sequences of treatments in digital health to support users in adopting healthier behaviors. Such sequential decision-making problems involve decisions about when to treat and how to treat based on the user's context (e.g., prior activity level, location, etc.). Online RL is a promising data-driven approach for this problem as it learns based on each user's historical responses and uses that knowledge to personalize these decisions. However, to decide whether the RL algorithm should be included in an ``optimized'' intervention for real-world deployment, we must assess the data evidence indicating that the RL algorithm is actually personalizing the treatments to its users. Due to the stochasticity in the RL algorithm, one may get a false impression that it is learning in certain states and using this learning to provide specific treatments. We use a working definition of personalization and introduce a resampling-based methodology for investigating whether the personalization exhibited by the RL algorithm is an artifact of the RL algorithm stochasticity. We illustrate our methodology with a case study by analyzing the data from a physical activity clinical trial called HeartSteps, which included the use of an online RL algorithm. We demonstrate how our approach enhances data-driven truth-in-advertising of algorithm personalization both across all users as well as within specific users in the study.

翻译：随着数字健康领域对个性化服务需求的不断增长，使用强化学习（RL）进行数字健康领域的个性化服务成为了一个热点。这样的序列决策问题需要根据用户的上下文（例如，先前的活动水平、位置等）而做出关于何时进行治疗以及如何进行治疗的决策。在线RL是一个很有前途的数据驱动方法，它基于每个用户的历史响应进行学习，并利用这些知识来个性化地进行决策。但是，为了判断RL算法是否应该被包括在一个“优化”的干预中以进行现实世界的部署，我们必须评估表明RL算法实际上正在将治疗个性化地适用于其用户的数据证据。由于RL算法中的随机性，人们可能会得出虚假的结论认为在特定状态下，它正在进行学习，并利用这种学习提供特定的治疗。我们使用一个工作定义来刻画个性化，并引入一个基于重抽样的方法来研究RL算法个性化是否是RL算法随机性的产物。我们通过分析一个名为HeartSteps的身体活动临床试验的数据来演示我们的方法。我们展示了我们的方法如何增强算法个性化数据驱动的“真实性”（即应用在所有用户和特定用户中的个性化效果）。

0

相关内容

个性化服务

个性化服务

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

【用强化学习转变医疗保健服务白皮书】Transforming healthcare with Reinforcement Learning

【用强化学习转变医疗保健服务白皮书】Transforming healthcare with Reinforcement Learning

专知会员服务

14+阅读 · 2022年2月26日

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

专知会员服务

66+阅读 · 2020年8月22日

【AAAI2020教程】强化学习中的Exploration-Exploitation in Reinforcement Learning

专知会员服务

101+阅读 · 2020年2月8日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【ALT 2019 Tutorials】强化学习的探索性开发（Exploration-Exploitation in Reinforcement Learning）

【ALT 2019 Tutorials】强化学习的探索性开发（Exploration-Exploitation in Reinforcement Learning）

专知会员服务

34+阅读 · 2019年3月21日

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

专知

16+阅读 · 2020年12月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于社会网络的大型在线社区中虚拟商品购买行为研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于模型的安全关键的信息物理融合系统的设计方法中的软件综合

国家自然科学基金

1+阅读 · 2014年12月31日

中药“用-量关系”的探索性研究-以大黄为例

国家自然科学基金

0+阅读 · 2013年12月31日

复杂数据中的变点、异常点检测及在线监控

国家自然科学基金

1+阅读 · 2012年12月31日

基于PCE的多层多域光网络QoS组播路由多目标优化算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模数据的个性化分类学习

国家自然科学基金

1+阅读 · 2012年12月31日

情境感知的个性化Web服务质量预测技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

各向异性网格下奇异摄动问题的有限元后验误差分析

国家自然科学基金

0+阅读 · 2011年12月31日

互联网环境下基于证据的信誉组合技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

Web Service QoS的多维多尺度模型及评估、预测方法的研究

国家自然科学基金

1+阅读 · 2008年12月31日

Online Learning in Multi-unit Auctions

Arxiv

0+阅读 · 2023年5月27日

No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions

Arxiv

0+阅读 · 2023年5月27日

Sequence Modeling is a Robust Contender for Offline Reinforcement Learning

Arxiv

0+阅读 · 2023年5月26日

Reinforcement Learning with Simple Sequence Priors

Reinforcement Learning with Simple Sequence Priors

Arxiv

0+阅读 · 2023年5月26日

Adversarial Attacks on Online Learning to Rank with Click Feedback

Arxiv

0+阅读 · 2023年5月26日

A Simulation Environment and Reinforcement Learning Method for Waste Reduction

Arxiv

0+阅读 · 2023年5月26日

Interactive Model Expansion in an Observable Environment

Arxiv

0+阅读 · 2023年5月20日

A Survey of Meta-Reinforcement Learning

Arxiv

12+阅读 · 2023年1月19日

Recent Advances in Reinforcement Learning in Finance

Arxiv

11+阅读 · 2021年12月8日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

个性化服务

强化学习算法

相关VIP内容

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

Meta最新WWW2022《联邦计算导论》教程，附77页ppt

专知会员服务

60+阅读 · 2022年5月5日

【用强化学习转变医疗保健服务白皮书】Transforming healthcare with Reinforcement Learning

【用强化学习转变医疗保健服务白皮书】Transforming healthcare with Reinforcement Learning

专知会员服务

14+阅读 · 2022年2月26日

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

【DeepMind】基于模型的强化学习，174页ppt，Model-Based Reinforcement Learning

专知会员服务

89+阅读 · 2021年1月12日

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

【硬核课】机器人学习课程，UT Austin朱玉可博士讲述自主机器人的人工智能与机器学习机器学习算法

专知会员服务

40+阅读 · 2020年9月21日

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

专知会员服务

66+阅读 · 2020年8月22日

【AAAI2020教程】强化学习中的Exploration-Exploitation in Reinforcement Learning

专知会员服务

101+阅读 · 2020年2月8日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【ALT 2019 Tutorials】强化学习的探索性开发（Exploration-Exploitation in Reinforcement Learning）

【ALT 2019 Tutorials】强化学习的探索性开发（Exploration-Exploitation in Reinforcement Learning）

专知会员服务

34+阅读 · 2019年3月21日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

专知

16+阅读 · 2020年12月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Online Learning in Multi-unit Auctions

Arxiv

0+阅读 · 2023年5月27日

No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions

Arxiv

0+阅读 · 2023年5月27日

Sequence Modeling is a Robust Contender for Offline Reinforcement Learning

Arxiv

0+阅读 · 2023年5月26日

Reinforcement Learning with Simple Sequence Priors

Reinforcement Learning with Simple Sequence Priors

Arxiv

0+阅读 · 2023年5月26日

Adversarial Attacks on Online Learning to Rank with Click Feedback

Arxiv

0+阅读 · 2023年5月26日

A Simulation Environment and Reinforcement Learning Method for Waste Reduction

Arxiv

0+阅读 · 2023年5月26日

Interactive Model Expansion in an Observable Environment

Arxiv

0+阅读 · 2023年5月20日

A Survey of Meta-Reinforcement Learning

Arxiv

12+阅读 · 2023年1月19日

Recent Advances in Reinforcement Learning in Finance

Arxiv

11+阅读 · 2021年12月8日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

相关基金

基于社会网络的大型在线社区中虚拟商品购买行为研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于模型的安全关键的信息物理融合系统的设计方法中的软件综合

国家自然科学基金

1+阅读 · 2014年12月31日

中药“用-量关系”的探索性研究-以大黄为例

国家自然科学基金

0+阅读 · 2013年12月31日

复杂数据中的变点、异常点检测及在线监控

国家自然科学基金

1+阅读 · 2012年12月31日

基于PCE的多层多域光网络QoS组播路由多目标优化算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模数据的个性化分类学习

国家自然科学基金

1+阅读 · 2012年12月31日

情境感知的个性化Web服务质量预测技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

各向异性网格下奇异摄动问题的有限元后验误差分析

国家自然科学基金

0+阅读 · 2011年12月31日

互联网环境下基于证据的信誉组合技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

Web Service QoS的多维多尺度模型及评估、预测方法的研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员