在线学习用于基于激励的需求响应 (Online Learning for Incentive-Based Demand Response) - 专知论文

会员服务 ·

0

基线 · 在线 · 最小二乘估计 · 负荷 · 扰动 ·

2023 年 3 月 27 日

Online Learning for Incentive-Based Demand Response

翻译：在线学习用于基于激励的需求响应

Deepan Muthirayan,Pramod P. Khargonekar

In this paper, we consider the problem of learning online to manage Demand Response (DR) resources. A typical DR mechanism requires the DR manager to assign a baseline to the participating consumer, where the baseline is an estimate of the counterfactual consumption of the consumer had it not been called to provide the DR service. A challenge in estimating baseline is the incentive the consumer has to inflate the baseline estimate. We consider the problem of learning online to estimate the baseline and to optimize the operating costs over a period of time under such incentives. We propose an online learning scheme that employs least-squares for estimation with a perturbation to the reward price (for the DR services or load curtailment) that is designed to balance the exploration and exploitation trade-off that arises with online learning. We show that, our proposed scheme is able to achieve a very low regret of $\mathcal{O}\left((\log{T})^2\right)$ with respect to the optimal operating cost over $T$ days of the DR program with full knowledge of the baseline, and is individually rational for the consumers to participate. Our scheme is significantly better than the averaging type approach, which only fetches $\mathcal{O}(T^{1/3})$ regret.

翻译：在本文中，我们考虑学习在线管理需求响应（DR）资源的问题。典型的DR机制要求DR管理器为参与的消费者分配一个基线，其中基线是对消费者计数实际消耗的估计，如果未被要求提供DR服务，消费者将消耗基线的负数值。估算基线的挑战在于消费者有动机夸大基线估计值。我们考虑在线学习如何估计基线并在此类激励下优化一段时间内的运营成本。我们提出了一种在线学习方案，该方案采用最小二乘估计，并向奖励价格（用于DR服务或负荷减缓）引入扰动，以平衡在线学习中出现的探索和开发的权衡。我们证明，我们的方案能够实现针对全面了解基线的DR计划的最优运营成本非常低的遗憾度，达到$\mathcal{O}((\log{T})^2)$，并且对于消费者参与是个人理性的。我们的方案明显优于仅能达到$\mathcal{O}(T^{1/3})$遗憾度的平均式方法。

0

相关内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【MLA 2019】机器学习中分布式鲁棒优化的一阶算法框架( Towards a First-Order Algorithmic Framework for Distributionally Robust Optimization in Machine Learning),香港中文大学苏文藻

【MLA 2019】机器学习中分布式鲁棒优化的一阶算法框架( Towards a First-Order Algorithmic Framework for Distributionally Robust Optimization in Machine Learning),香港中文大学苏文藻

专知会员服务

28+阅读 · 2019年11月6日

【CIKM2019 Tutorial】Recommendation for Multi-Stakeholders and through Neural Review Mining，附158页PDF免费下载

【CIKM2019 Tutorial】Recommendation for Multi-Stakeholders and through Neural Review Mining，附158页PDF免费下载

专知会员服务

21+阅读 · 2019年11月3日

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

泡泡机器人SLAM

10+阅读 · 2019年4月26日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

基于机电混合数据驱动的风力发电机故障诊断与预测方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于汽凝法的生物芯片的开发与应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

多CDN自适应流媒体分发的用户体验与网络资源联合优化研究

国家自然科学基金

0+阅读 · 2014年12月31日

平稳相依空间数据下基于经验似然的非参数统计推断

国家自然科学基金

0+阅读 · 2013年12月31日

路面附着状态在线识别及微型客车防侧翻控制算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

参与式感知系统中基于众包的动态群组构造方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多维联合分布理论的沙尘暴风险评估Coupla模型研究：以内蒙古中部为例

国家自然科学基金

0+阅读 · 2011年12月31日

基于风险偏好的模糊博弈及宁夏煤炭资源开发与环境保护最优策略研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于光声光谱技术的环境污染源微量气体传感器的研究

国家自然科学基金

0+阅读 · 2010年12月31日

基于环境感知的应急响应知识需求研究

国家自然科学基金

5+阅读 · 2008年12月31日

Adaptive Learning based Upper-Limb Rehabilitation Training System with Collaborative Robot

Arxiv

0+阅读 · 2023年5月18日

Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning

Arxiv

0+阅读 · 2023年5月17日

Policy Learning for Active Target Tracking over Continuous SE(3) Trajectories

Arxiv

0+阅读 · 2023年5月16日

Energy-Efficient URLLC Service Provision via a Near-Space Information Network

Arxiv

0+阅读 · 2023年5月16日

PIQI: Perceptual Image Quality Index based on Ensemble of Gaussian Process Regression

Arxiv

0+阅读 · 2023年5月16日

Federated Learning Challenges and Opportunities: An Outlook

Arxiv

15+阅读 · 2022年2月1日

Improving evidential deep learning via multi-task learning

Arxiv

11+阅读 · 2021年12月17日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

VIP会员

文章信息

相关主题

最小二乘估计

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【MLA 2019】机器学习中分布式鲁棒优化的一阶算法框架( Towards a First-Order Algorithmic Framework for Distributionally Robust Optimization in Machine Learning),香港中文大学苏文藻

【MLA 2019】机器学习中分布式鲁棒优化的一阶算法框架( Towards a First-Order Algorithmic Framework for Distributionally Robust Optimization in Machine Learning),香港中文大学苏文藻

专知会员服务

28+阅读 · 2019年11月6日

【CIKM2019 Tutorial】Recommendation for Multi-Stakeholders and through Neural Review Mining，附158页PDF免费下载

【CIKM2019 Tutorial】Recommendation for Multi-Stakeholders and through Neural Review Mining，附158页PDF免费下载

专知会员服务

21+阅读 · 2019年11月3日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

浅聊对比学习（Contrastive Learning）第一弹

浅聊对比学习（Contrastive Learning）第一弹

PaperWeekly

0+阅读 · 2022年6月10日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

泡泡机器人SLAM

10+阅读 · 2019年4月26日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

Adaptive Learning based Upper-Limb Rehabilitation Training System with Collaborative Robot

Arxiv

0+阅读 · 2023年5月18日

Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning

Arxiv

0+阅读 · 2023年5月17日

Policy Learning for Active Target Tracking over Continuous SE(3) Trajectories

Arxiv

0+阅读 · 2023年5月16日

Energy-Efficient URLLC Service Provision via a Near-Space Information Network

Arxiv

0+阅读 · 2023年5月16日

PIQI: Perceptual Image Quality Index based on Ensemble of Gaussian Process Regression

Arxiv

0+阅读 · 2023年5月16日

Federated Learning Challenges and Opportunities: An Outlook

Arxiv

15+阅读 · 2022年2月1日

Improving evidential deep learning via multi-task learning

Arxiv

11+阅读 · 2021年12月17日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

相关基金

基于机电混合数据驱动的风力发电机故障诊断与预测方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于汽凝法的生物芯片的开发与应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

多CDN自适应流媒体分发的用户体验与网络资源联合优化研究

国家自然科学基金

0+阅读 · 2014年12月31日

平稳相依空间数据下基于经验似然的非参数统计推断

国家自然科学基金

0+阅读 · 2013年12月31日

路面附着状态在线识别及微型客车防侧翻控制算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

参与式感知系统中基于众包的动态群组构造方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多维联合分布理论的沙尘暴风险评估Coupla模型研究：以内蒙古中部为例

国家自然科学基金

0+阅读 · 2011年12月31日

基于风险偏好的模糊博弈及宁夏煤炭资源开发与环境保护最优策略研究

国家自然科学基金

1+阅读 · 2011年12月31日

基于光声光谱技术的环境污染源微量气体传感器的研究

国家自然科学基金

0+阅读 · 2010年12月31日

基于环境感知的应急响应知识需求研究

国家自然科学基金

5+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员