B2RL:用于建筑批量强化学习的开放源数据集 (B2RL: An open-source Dataset for Building Batch Reinforcement Learning) - 专知论文

会员服务 ·

0

Learning · 数据集 · Buffer（公司） · 知识 (knowledge) · 强化学习 ·

2022 年 9 月 30 日

B2RL: An open-source Dataset for Building Batch Reinforcement Learning

翻译：B2RL:用于建筑批量强化学习的开放源数据集

Hsin-Yu Liu,Xiaohan Fu,Bharathan Balaji,Rajesh Gupta,Dezhi Hong

Batch reinforcement learning (BRL) is an emerging research area in the RL community. It learns exclusively from static datasets (i.e. replay buffers) without interaction with the environment. In the offline settings, existing replay experiences are used as prior knowledge for BRL models to find the optimal policy. Thus, generating replay buffers is crucial for BRL model benchmark. In our B2RL (Building Batch RL) dataset, we collected real-world data from our building management systems, as well as buffers generated by several behavioral policies in simulation environments. We believe it could help building experts on BRL research. To the best of our knowledge, we are the first to open-source building datasets for the purpose of BRL learning.

翻译：批量强化学习( BRL) 是RL 社区中一个新兴的研究领域。它只从静态数据集( 即重放缓冲) 中学习, 而不与环境互动。在离线设置中, 现有的重播经验被用作 BRL 模型的先前知识, 以找到最佳政策。因此, 生成重播缓冲对于 BRL 模型基准至关重要。在 B2RL ( 建设批量RL) 数据集中, 我们从我们的建筑管理系统中收集了真实世界的数据, 以及模拟环境中的若干行为政策生成的缓冲。我们认为它可以帮助培养 BRL 研究专家。根据我们的知识, 我们是第一个为 BRL 学习目的开源构建数据集的。

0

相关内容

Learning

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

荧光/磁性纳米球用于循环肿瘤细胞的高灵敏检测

国家自然科学基金

0+阅读 · 2015年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

多脉冲强流电子束的能量累积效应对多相Al-Co-Ce合金非晶态转变过程的影响机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

磁性多孔聚合物用于贵金属纳米污染物的分离富集与分析

国家自然科学基金

0+阅读 · 2015年12月31日

具有临界指数的Schrodinger-Poisson系统的解

国家自然科学基金

0+阅读 · 2013年12月31日

热电磁流动和热电磁力对液/固界面稳定性和枝晶生长的影响

国家自然科学基金

0+阅读 · 2012年12月31日

煤粉炉增钙脱硫粉煤灰Q相-3CaO？3Al2O3？CaSO4系列矿物生成机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

碳酸盐岩区硫化物尾矿中重金属赋存状态研究

国家自然科学基金

0+阅读 · 2012年12月31日

羰基化聚吡咯/纳米孔TiO2复合材料光电转换机理的研究

国家自然科学基金

0+阅读 · 2011年12月31日

添加新型富钡相RE242制备高超导性能REBCO块材

国家自然科学基金

0+阅读 · 2011年12月31日

GRIMGEP: Learning Progress for Robust Goal Sampling in Visual Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年11月7日

Reward-Predictive Clustering

Arxiv

0+阅读 · 2022年11月7日

Collaborative Video Analytics on Distributed Edges with Multiagent Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年11月6日

Reliable Off-policy Evaluation for Reinforcement Learning

Arxiv

0+阅读 · 2022年11月3日

Reinforcement Learning on Graph: A Survey

Arxiv

67+阅读 · 2022年4月13日

Reinforcement Learning based Air Combat Maneuver Generation

Reinforcement Learning based Air Combat Maneuver Generation

Arxiv

91+阅读 · 2022年1月14日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

20+阅读 · 2020年3月10日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Arxiv

34+阅读 · 2019年10月24日

VIP会员

文章信息

相关主题

Buffer（公司）

知识 (knowledge)

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《分析与预测陆军战斗体能测试表现：统计与机器学习方法》2025最新137页

《军事行动中的人机协同共同学习》2025最新文献

代理式人工智能时代的决策优势

《F/A-18机队替换中队仿真模型的设计与分析》2025最新73页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

GRIMGEP: Learning Progress for Robust Goal Sampling in Visual Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年11月7日

Reward-Predictive Clustering

Arxiv

0+阅读 · 2022年11月7日

Collaborative Video Analytics on Distributed Edges with Multiagent Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年11月6日

Reliable Off-policy Evaluation for Reinforcement Learning

Arxiv

0+阅读 · 2022年11月3日

Reinforcement Learning on Graph: A Survey

Arxiv

67+阅读 · 2022年4月13日

Reinforcement Learning based Air Combat Maneuver Generation

Reinforcement Learning based Air Combat Maneuver Generation

Arxiv

91+阅读 · 2022年1月14日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

20+阅读 · 2020年3月10日

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning

Arxiv

26+阅读 · 2020年2月10日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Arxiv

34+阅读 · 2019年10月24日

相关基金

荧光/磁性纳米球用于循环肿瘤细胞的高灵敏检测

国家自然科学基金

0+阅读 · 2015年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

多脉冲强流电子束的能量累积效应对多相Al-Co-Ce合金非晶态转变过程的影响机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

磁性多孔聚合物用于贵金属纳米污染物的分离富集与分析

国家自然科学基金

0+阅读 · 2015年12月31日

具有临界指数的Schrodinger-Poisson系统的解

国家自然科学基金

0+阅读 · 2013年12月31日

热电磁流动和热电磁力对液/固界面稳定性和枝晶生长的影响

国家自然科学基金

0+阅读 · 2012年12月31日

煤粉炉增钙脱硫粉煤灰Q相-3CaO？3Al2O3？CaSO4系列矿物生成机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

碳酸盐岩区硫化物尾矿中重金属赋存状态研究

国家自然科学基金

0+阅读 · 2012年12月31日

羰基化聚吡咯/纳米孔TiO2复合材料光电转换机理的研究

国家自然科学基金

0+阅读 · 2011年12月31日

添加新型富钡相RE242制备高超导性能REBCO块材

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员