【微软2022】强化学习全球开源节挑战项目，报名截止 2022.4.4，申请成功奖励1万美金。 - 专知

会员服务 ·

0

【微软2022】强化学习全球开源节挑战项目，报名截止 2022.4.4，申请成功奖励1万美金。

2022 年 3 月 15 日 深度强化学习实验室

来源：微软

编辑：DeepRLHub

The Reinforcement Learning (RL) Open Source Fest is a global online program focused on introducing students to open-source reinforcement learning programs and software development while working alongside researchers, data scientists, and engineers on the Real World Reinforcement Learning team at Microsoft Research NYC. Students will work on a four-month research programming project for either a Summer (May-August 2022) or Fall session (September – December 2022). Accepted students will receive a $10,000 USD stipend. Selected students will receive their stipend payment at the beginning of their session. Microsoft sends the payment directly to a student’s academic institution, which then disperses funds according to the institution’s guidelines.

Our goal is to bring together a diverse group of students from around the world to collectively solve open-source reinforcement learning problems and advance the state-of-the-art research and development alongside the RL community while providing open-source code written and released to benefit all.

At the end of the program, students will present each of their projects to the Microsoft Research Real World Reinforcement Learning team online.

Open-source projects

Vowpal Wabbit (VW) is an open-source machine learning library created by John Langford and developed by Microsoft Research with the help of many contributors. It is a fast, flexible, online, and active learning solution that empowers people to solve complex interactive machine learning problems, with a large focus on contextual bandits and reinforcement learning. It is a vehicle for both research prototyping and driving bleeding edge algorithms to production. RL OS Fest is all about open-source projects in the Vowpal Wabbit ecosystem.

项目列表：

https://vowpalwabbit.org/rlos/2022/projects

Eligibility

To be eligible for the program, students must be enrolled in or accepted into an accredited institution including colleges, universities, Master programs, PhD programs, and undergraduate programs.

Student responsibilities during the program

Submit quality work: code compiles, has unit tests and documentation, and passes code review
Regularly communicate work completed, what you intend to do next, and blockers
Re-evaluate project tasks if you’re significantly ahead or behind schedule
Regular check-ins with your mentor/collaborator
Listen and respond to feedback
Pro-active learning

Program Timeline

*The upcoming program dates are subject to change, and will be finalized and updated here by March 1, 2022

March 1, 2022 | Application period opens
April 4, 2022 | Application period closes

April 25, 2022 | Selected applicants notified
May 9, 2022| Summer projects begin
August 15, 2022 | Summer project presentations

September 12, 2022| Fall projects begin
December 2, 2022 | Fall project presentations

更多详情查看微软官网

https://www.microsoft.com/en-us/research/academic-program/rl-open-source-fest/

登录查看更多

0

相关内容

强化学习

强化学习（RL）是机器学习的一个领域，与软件代理应如何在环境中采取行动以最大化累积奖励的概念有关。除了监督学习和非监督学习外，强化学习是三种基本的机器学习范式之一。强化学习与监督学习的不同之处在于，不需要呈现带标签的输入/输出对，也不需要显式纠正次优动作。相反，重点是在探索（未知领域）和利用（当前知识）之间找到平衡。该环境通常以马尔可夫决策过程（MDP）的形式陈述，因为针对这种情况的许多强化学习算法都使用动态编程技术。经典动态规划方法和强化学习算法之间的主要区别在于，后者不假设MDP的确切数学模型，并且针对无法采用精确方法的大型MDP。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

【微软】强化学习系统，37页ppt

专知会员服务

40+阅读 · 2021年6月29日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

海外内推 | 新加坡科技研究局 (A*STAR) 高性能计算研究院招聘AI医疗方向研究员

海外内推 | 新加坡科技研究局 (A*STAR) 高性能计算研究院招聘AI医疗方向研究员

PaperWeekly

1+阅读 · 2022年3月31日

微软招聘祝大家春节快乐！

微软招聘祝大家春节快乐！

微软招聘

0+阅读 · 2022年1月31日

微软校招 | 2022暑期实习招聘正式启动！

微软校招 | 2022暑期实习招聘正式启动！

微软招聘

0+阅读 · 2022年1月10日

微软办公环境大揭秘！

微软办公环境大揭秘！

微软招聘

0+阅读 · 2021年12月24日

星跃计划 | MSR Asia-MSR Redmond 联合科研计划开放，人才持续招募中！

星跃计划 | MSR Asia-MSR Redmond 联合科研计划开放，人才持续招募中！

微软研究院AI头条

0+阅读 · 2021年11月16日

博后招募 | 新加坡国立大学WING实验室招募自然语言处理方向博士后

博后招募 | 新加坡国立大学WING实验室招募自然语言处理方向博士后

PaperWeekly

0+阅读 · 2021年10月13日

微软2022校招正式启动！

微软2022校招正式启动！

微软招聘

0+阅读 · 2021年8月16日

田厂秋招正式开启还剩1天！

田厂秋招正式开启还剩1天！

微软招聘

0+阅读 · 2021年8月15日

谷歌足球游戏环境使用介绍

谷歌足球游戏环境使用介绍

CreateAMind

33+阅读 · 2019年6月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

人为热释放全球气候效应的数值模拟及评估

国家自然科学基金

2+阅读 · 2015年12月31日

基于神经网络和群体智能的稀疏表示算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于神经网络和强化学习的车辆装配系统中的多载量小车实时调度方法

国家自然科学基金

4+阅读 · 2014年12月31日

全球化背景下中国产业转型升级的机制与政策研究

国家自然科学基金

3+阅读 · 2013年12月31日

深度学习理论及在图像识别中的应用研究

国家自然科学基金

6+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

脑意图受限映射下的四足机器人脑机行为交互机理与协作控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

求解多目标旅行商问题的分布估计算法研究

国家自然科学基金

1+阅读 · 2010年12月31日

南海深海过程演变学术交流活动

国家自然科学基金

0+阅读 · 2010年12月31日

进化规划算法的计算时间难题研究

国家自然科学基金

0+阅读 · 2010年12月31日

Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism

Arxiv

0+阅读 · 2022年4月16日

Neural Re-ranking in Multi-stage Recommender Systems: A Review

Arxiv

0+阅读 · 2022年4月16日

Investigating the Impact of Forgetting in Software Development

Arxiv

0+阅读 · 2022年4月15日

Attention Mechanisms in Computer Vision: A Survey

Arxiv

58+阅读 · 2021年11月15日

A Survey on Reinforcement Learning for Recommender Systems

Arxiv

22+阅读 · 2021年9月22日

Domain Generalization in Vision: A Survey

Arxiv

16+阅读 · 2021年7月18日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

Dynamic Graph Neural Networks

Arxiv

24+阅读 · 2018年10月24日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

Multimodal Machine Learning: A Survey and Taxonomy

Arxiv

151+阅读 · 2017年8月1日

VIP会员

相关主题

Microsoft Research

Machine Learning

相关VIP内容

【微软】强化学习系统，37页ppt

专知会员服务

40+阅读 · 2021年6月29日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《战略分析：面向国防与国际安全的建模与仿真》

《俄乌战争中影响力行动的社交媒体分析》2025最新69页

什么是模块化开放系统方法（MOSA）？从美陆军新型倾转旋翼机视角解读

《用于评估军事作战场景的仿真环境》

相关资讯

海外内推 | 新加坡科技研究局 (A*STAR) 高性能计算研究院招聘AI医疗方向研究员

海外内推 | 新加坡科技研究局 (A*STAR) 高性能计算研究院招聘AI医疗方向研究员

PaperWeekly

1+阅读 · 2022年3月31日

微软招聘祝大家春节快乐！

微软招聘祝大家春节快乐！

微软招聘

0+阅读 · 2022年1月31日

微软校招 | 2022暑期实习招聘正式启动！

微软校招 | 2022暑期实习招聘正式启动！

微软招聘

0+阅读 · 2022年1月10日

微软办公环境大揭秘！

微软办公环境大揭秘！

微软招聘

0+阅读 · 2021年12月24日

星跃计划 | MSR Asia-MSR Redmond 联合科研计划开放，人才持续招募中！

星跃计划 | MSR Asia-MSR Redmond 联合科研计划开放，人才持续招募中！

微软研究院AI头条

0+阅读 · 2021年11月16日

博后招募 | 新加坡国立大学WING实验室招募自然语言处理方向博士后

博后招募 | 新加坡国立大学WING实验室招募自然语言处理方向博士后

PaperWeekly

0+阅读 · 2021年10月13日

微软2022校招正式启动！

微软2022校招正式启动！

微软招聘

0+阅读 · 2021年8月16日

田厂秋招正式开启还剩1天！

田厂秋招正式开启还剩1天！

微软招聘

0+阅读 · 2021年8月15日

谷歌足球游戏环境使用介绍

谷歌足球游戏环境使用介绍

CreateAMind

33+阅读 · 2019年6月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关基金

人为热释放全球气候效应的数值模拟及评估

国家自然科学基金

2+阅读 · 2015年12月31日

基于神经网络和群体智能的稀疏表示算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于神经网络和强化学习的车辆装配系统中的多载量小车实时调度方法

国家自然科学基金

4+阅读 · 2014年12月31日

全球化背景下中国产业转型升级的机制与政策研究

国家自然科学基金

3+阅读 · 2013年12月31日

深度学习理论及在图像识别中的应用研究

国家自然科学基金

6+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

脑意图受限映射下的四足机器人脑机行为交互机理与协作控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

求解多目标旅行商问题的分布估计算法研究

国家自然科学基金

1+阅读 · 2010年12月31日

南海深海过程演变学术交流活动

国家自然科学基金

0+阅读 · 2010年12月31日

进化规划算法的计算时间难题研究

国家自然科学基金

0+阅读 · 2010年12月31日

相关论文

Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism

Arxiv

0+阅读 · 2022年4月16日

Neural Re-ranking in Multi-stage Recommender Systems: A Review

Arxiv

0+阅读 · 2022年4月16日

Investigating the Impact of Forgetting in Software Development

Arxiv

0+阅读 · 2022年4月15日

Attention Mechanisms in Computer Vision: A Survey

Arxiv

58+阅读 · 2021年11月15日

A Survey on Reinforcement Learning for Recommender Systems

Arxiv

22+阅读 · 2021年9月22日

Domain Generalization in Vision: A Survey

Arxiv

16+阅读 · 2021年7月18日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

Dynamic Graph Neural Networks

Arxiv

24+阅读 · 2018年10月24日

Deep Reinforcement Learning for List-wise Recommendations

Arxiv

13+阅读 · 2018年1月5日

Multimodal Machine Learning: A Survey and Taxonomy

Arxiv

151+阅读 · 2017年8月1日

大家都在搜

NTU博士论文

国防科技创新

精排模型-从MLP到行为序列：DIN、DIEN、MIMN、SIM、DSIN

微信扫码咨询专知VIP会员