深度强化学习实验室报道
来源:ICML2020
作者: RchalYang
ICML 2020放榜了。入选论文创新高,共有1088篇论文突出重围。然而,接收率却是一年比一年低,这次仅为21.8%(去年为22.6%,前年为24.9%)。从整个榜单上看,谷歌仍为最强实力机构,共有138篇收录(数据包含谷歌大脑、DeepMind)。加州大学伯克利分校:88篇,斯坦福:75篇, MIT:66篇,微软:53篇,Facebook:32篇,IBM:19篇,其中国内机构也表现不俗。尤其是一直以来作为主力的大学们。清华:36篇,北大:20篇,上交:16篇,其中强化学习占有率达到了:11.58%, 下面是强化学习领域论文
Ilai Bistritz (Stanford University),Tavor Z Baharav (Stanford University),Amir Leshem (Bar-Ilan University),Nicholas Bambos
Ayush Jain (University of Southern California) · Andrew Szot (University of Southern California) · Joseph Lim (Univ. of Southern California)
Sankalp Garg (Indian Institute of Technology Delhi) · Aniket Bajpai (Indian Institute of Technology, Delhi) · Mausam (IIT Delhi)
Jiawei Huang (University of Illinois at Urbana-Champaign) · Nan Jiang (University of Illinois at Urbana-Champaign)
Abhimanyu Dubey (Massachusetts Institute of Technology) · Alex `Sandy' Pentland (MIT)
Abhimanyu Dubey (Massachusetts Institute of Technology) · Alex `Sandy' Pentland (MIT)
Hanrui Zhang (Duke University) · Vincent Conitzer (Duke)
Aadirupa Saha (Indian Institute of Science (IISc), Bangalore) · Pierre Gaillard () · Michal Valko (DeepMind)
Yaodong Yang (Huawei Technology R&D UK) · Ying Wen (UCL) · Jun Wang (UCL) · Liheng Chen (Shanghai Jiao Tong University) · Kun Shao (Huawei Noah's Ark Lab) · David Mguni (Noah's Ark Laboratory, Huawei) · Weinan Zhang (Shanghai Jiao Tong University)
Masatoshi Uehara (Harvard University) · Jiawei Huang (University of Illinois at Urbana-Champaign) · Nan Jiang (University of Illinois at Urbana-Champaign)
Rundong Wang (Nanyang Technological University) · Xu He (Nanyang Technological University) · Runsheng Yu (Nanyang Technological University) · Wei Qiu (Nanyang Technological University) · Bo An (Nanyang Technological University) · Zinovi Rabinovich (Nanyang Technological University)
Kefan Dong (Tsinghua University) · Yingkai Li (Northwestern University) · Qin Zhang (Indiana University Bloomington) · Yuan Zhou (UIUC)
Xinyi Wang (Carnegie Mellon University) · Hieu Pham (Carnegie Mellon University) · Paul Michel (Carnegie Mellon University) · Antonios Anastasopoulos (Carnegie Mellon University) · Jaime Carbonell (Carnegie Mellon University) · Graham Neubig (Carnegie Mellon University)
Lior Shani (Technion) · Yonathan Efroni (Technion) · Aviv Rosenberg (Tel Aviv University) · Shie Mannor (Technion)
Chi Jin (Princeton University) · Tiancheng Jin (University of Southern California) · Haipeng Luo (University of Southern California) · Suvrit Sra (MIT) · Tiancheng Yu (MIT)
James Kostas (University of Massachusetts Amherst) · Chris Nota (University of Massachusetts Amherst) · Philip Thomas (University of Massachusetts Amherst)
Yao Liu (Stanford University) · Pierre-Luc Bacon (Stanford University) · Emma Brunskill (Stanford University)
Yunhao Tang (Columbia University) · Shipra Agrawal (Columbia University) · Yuri Faenza (Columbia University)
Akifumi Wachi (IBM Research AI) · Yanan Sui (Tsinghua University)
Tonghan Wang (Tsinghua University) · Heng Dong (Tsinghua) · Victor Lesser (UMASS) · Chongjie Zhang (Tsinghua University)
Max Simchowitz (UC Berkeley) · Dylan Foster (MIT)
Neale Ratzlaff (Oregon State University) · Qinxun Bai (Horizon Robotics) · Fuxin Li (Oregon State University) · Wei Xu (Horizon Robotics)
Jie Xu (Massachusetts Institute of Technology) · Yunsheng Tian (Massachusetts Institute of Technology) · Pingchuan Ma (MIT) · Daniela Rus (MIT CSAIL) · Shinjiro Sueda (Texas A&M University) · Wojciech Matusik (MIT)
Nathan Kallus (Cornell University) · Masatoshi Uehara (Harvard University)
Nathan Kallus (Cornell University) · Masatoshi Uehara (Harvard University)
Simon Schmitt (DeepMind) · Matteo Hessel (Deep Mind) · Karen Simonyan (DeepMind)
Amin Rakhsha (MPI-SWS) · Goran Radanovic (Max Planck Institute for Software Systems) · Rati Devidze (Max Planck Institute for Software Systems) · Jerry Zhu (University of Wisconsin-Madison) · Adish Singla (Max Planck Institute (MPI-SWS))
Chengchun Shi (London School of Economics and Political Science) · Runzhe Wan (North Carolina State University) · Rui Song () · Wenbin Lu () · Ling Leng (Amazon)
Jean Tarbouriech (Facebook AI Research Paris & Inria Lille) · Evrard Garcelon (Facebook AI Research ) · Michal Valko (DeepMind) · Matteo Pirotta (Facebook AI Research) · Alessandro Lazaric (Facebook AI Research)
Alexander Vezhnevets (DeepMind) · Yuhuai Wu (University of Toronto) · Maria Eckstein (UC Berkeley) · Rémi Leblond (DeepMind) · Joel Z Leibo (DeepMind)
Gregor Simm (Cambridge University) · Robert Pinsler (University of Cambridge) · Jose Hernandez-Lobato (University of Cambridge)
DiJia Su (Princeton University) · Jayden Ooi (Google) · Tyler Lu (Google) · Dale Schuurmans (Google / University of Alberta) · Craig Boutilier (Google)
Qi Cai (Northwestern University) · Zhuoran Yang (Princeton University) · Chi Jin (Princeton University) · Zhaoran Wang (Northwestern U)
Che Wang (New York University) · Yanqiu Wu (New York University) · Quan Vuong (University of California San Diego) · Keith Ross (New York University Shanghai)
Youzhi Zhang (Nanyang Technological University) · Bo An (Nanyang Technological University)
Victor Campos (Barcelona Supercomputing Center) · Alexander Trott (Salesforce Research) · Caiming Xiong (Salesforce) · Richard Socher (Salesforce) · Xavier Giro-i-Nieto (Universitat Politecnica de Catalunya) · Jordi Torres (Barcelona Supercomputing Center)
Brian Zhang (Carnegie Mellon University) · Tuomas Sandholm (Carnegie Mellon University)
Samy Jelassi (Princeton University) · Carles Domingo-Enrich (NYU) · Damien Scieur (Samsung Advanced Institute of Technology AI Lab Montreal (SAIL)) · Arthur Mensch (ENS) · Joan Bruna (New York University)
Evgeny Kharitonov (FAIR) · Rahma Chaabouni (Facebook/ENS/INRIA) · Diane Bouchacourt (Facebook AI) · Marco Baroni (Facebook Artificial Intelligence Research)
Ron Amit (Technion – Israel Institute of Technology) · Kamil Ciosek (Microsoft) · Ron Meir (Technion Israeli Institute of Technology)
Kuno Kim (Stanford University) · Yihong Gu (Tsinghua University) · Jiaming Song (Stanford) · Shengjia Zhao (Stanford University) · Stefano Ermon (Stanford University)
Evan Liu (Google) · Milad Hashemi (Google) · Kevin Swersky (Google Brain) · Parthasarathy Ranganathan (Google, USA) · Junwhan Ahn (Google)
Lingxiao Wang (Northwestern University) · Zhuoran Yang (Princeton University) · Zhaoran Wang (Northwestern U)
Quinlan Sykora (Uber ATG) · Mengye Ren (Uber ATG / University of Toronto) · Raquel Urtasun (Uber ATG)
Pan Xu (University of California, Los Angeles) · Quanquan Gu (University of California, Los Angeles)
Johannes Fischer (Karlsruhe Institute of Technology (KIT)) · Ömer Sahin Tas (Karlsruhe Institute of Technology (KIT))
Dylan Foster (MIT) · Alexander Rakhlin (MIT)
Xi Liu (Texas A&M University) · Ping-Chun Hsieh (National Chiao Tung University) · Yu Heng Hung (NCTU) · Anirban Bhattacharya (Texas A&M University) · P. Kumar (Texas A&M University)
Yi Su (Cornell University) · Pavithra Srinath (Microsoft Research) · Akshay Krishnamurthy (Microsoft Research)
Claire Vernade (DeepMind) · Alexandra Carpentier (Otto-von-Guericke University) · Tor Lattimore (DeepMind) · Giovanni Zappella (Amazon) · Beyza Ermis (Amazon Research) · Michael Brueckner (Amazon Research Berlin)
Feihu Huang (University of Pittsburgh) · Shangqian Gao (University of Pittsburgh) · Jian Pei (Simon Fraser University) · Heng Huang (University of Pittsburgh)
Alberto Maria Metelli (Politecnico di Milano) · Flavio Mazzolini (Politecnico di Milano) · Lorenzo Bisi (Politecnico di Milano) · Luca Sabbioni (Politecnico di Milano) · Marcello Restelli (Politecnico di Milano)
Zeyu Zheng (University of Michigan) · Junhyuk Oh (DeepMind) · Matteo Hessel (Deep Mind) · Zhongwen Xu (DeepMind) · Manuel Kroiss (DeepMind) · Hado van Hasselt (DeepMind) · David Silver (Google DeepMind) · Satinder Singh (DeepMind)
Giuseppe Vietri (University of Minnesota) · Borja de Balle Pigem (Amazon Research) · Steven Wu (University of Minnesota) · Akshay Krishnamurthy (Microsoft Research)
Louis Faury (Criteo) · Marc Abeille (Criteo) · Clement Calauzenes (Criteo) · Olivier Fercoq (Telecom Paris)
Gregory Farquhar (University of Oxford) · Laura Gustafson (Facebook AI Research) · Zeming Lin (Facebook AI Reseach) · Shimon Whiteson (Oxford University) · Nicolas Usunier (Facebook AI Research) · Gabriel Synnaeve (Facebook AI Research)
Adam Stooke (UC Berkeley) · Joshua Achiam (OpenAI) · Pieter Abbeel (UC Berkeley & Covariant)
Emilio Parisotto (Carnegie Mellon University) · Francis Song (DeepMind) · Jack Rae (DeepMind) · Razvan Pascanu (DeepMind) · Caglar Gulcehre (DeepMind) · Siddhant Jayakumar (DeepMind) · Max Jaderberg (DeepMind) · Raphael Lopez Kaufman (Deepmind) · Aidan Clark (DeepMind) · Seb Noury (DeepMind) · Matthew Botvinick (DeepMind) · Nicolas Heess (DeepMind) · Raia Hadsell (DeepMind)
Aldo Pacchiano (UC Berkeley) · Jack Parker-Holder (University of Oxford) · Yunhao Tang (Columbia University) · Krzysztof Choromanski (Google) · Anna Choromanska (NYU Tandon School of Engineering) · Michael Jordan (UC Berkeley)
Dongruo Zhou (UCLA) · Lihong Li (Google Research) · Quanquan Gu (University of California, Los Angeles)
Nian Si (Stanford University) · Fan Zhang (Stanford University) · Zhengyuan Zhou (Stanford University) · Jose Blanchet (Stanford University)
Andrew Bennett (Cornell University) · Nathan Kallus (Cornell University)
Tanmay Shankar (Facebook AI Research) · Abhinav Gupta (Carnegie Mellon University)
Karl Cobbe (OpenAI) · Chris Hesse (OpenAI) · Jacob Hilton (OpenAI) · John Schulman (OpenAI)
Khimya Khetarpal (McGill University, Mila Montreal) · Zafarali Ahmed (DeepMind) · Gheorghe Comanici (DeepMind) · David Abel (Brown University) · Doina Precup (DeepMind)
Jinsung Yoon (University of California, Los Angeles) · Sercan O. Arik (Google) · Tomas Pfister (Google)
Chi Jin (Princeton University) · Akshay Krishnamurthy (Microsoft Research) · Max Simchowitz (UC Berkeley) · Tiancheng Yu (MIT )
Junzhe Zhang (Columbia University)
Ibrahim El Shar (University of Pittsburgh) · Daniel Jiang (University of Pittsburgh)
Scott Jordan (University of Massachusetts Amherst) · Yash Chandak (University of Massachusetts Amherst) · Daniel Cohen (University of Massachusetts Amherst) · Mengxue Zhang (umass Amherst ) · Philip Thomas (University of Massachusetts Amherst)
Yu Bai (Salesforce Research) · Chi Jin (Princeton University)
Aravind Rajeswaran (University of Washington) · Igor Mordatch (OpenAI) · Vikash Kumar (Google)
Yash Chandak (University of Massachusetts Amherst) · Georgios Theocharous (Adobe Research) · Shiv Shankar (University of Massachusetts) · Martha White (University of Alberta) · Sridhar Mahadevan (Adobe Research) · Philip Thomas (University of Massachusetts Amherst)
Tung-Che Liang (Duke University) · Zhanwei Zhong (Duke University) · Yaas Bigdeli (Duke Univsersity) · Tsung-Yi Ho (National Tsing Hua University) · Richard Fair (Duke University) · Krishnendu Chakrabarty (Duke University)
Aleksei Petrenko (University of Southern California) · Zhehui Huang (University of Southern California) · Tushar Kumar (University of Southern California) · Gaurav Sukhatme (University of Southern California) · Vladlen Koltun (Intel Labs)
Yaodong Yang (Tianjin University) · Jianye Hao (Tianjin University) · Guangyong Chen (Tencent) · Hongyao Tang (Tianjin University) · Yingfeng Chen (NetEase Fuxi AI Lab) · Yujing Hu (NetEase Fuxi AI Lab) · Changjie Fan (Netease) · Zhongyu Wei (Fudan University)
Tianyi Lin (UC Berkeley) · Zhengyuan Zhou (Stanford University) · Panayotis Mertikopoulos (CNRS) · Michael Jordan (UC Berkeley)
Feng Zhu (Peking University) · Zeyu Zheng (UC Berkeley)
Kimin Lee (UC Berkeley) · Younggyo Seo (KAIST) · Seunghyun Lee (KAIST) · Honglak Lee (Google / U. Michigan) · Jinwoo Shin (KAIST)
Youngsuk Park (Stanford University) · Ryan Rossi (Adobe Research) · Zheng Wen (DeepMind) · Gang Wu (Adobe Research) · Handong Zhao (Adobe Research)
Jean-Bastien Grill (DeepMind) · Florent Altché (DeepMind) · Yunhao Tang (Columbia University) · Thomas Hubert (DeepMind) · Michal Valko (DeepMind) · Ioannis Antonoglou (Deepmind) · Remi Munos (DeepMind)
Kefan Dong (Tsinghua University) · Yuping Luo (Princeton University) · Tianhe Yu (Stanford University) · Chelsea Finn (Stanford) · Tengyu Ma (Stanford)
Xingrui Yu (University of Technology Sydney) · Yueming LYU (University of Technology Sydney) · Ivor Tsang (University of Technology Sydney)
Kei Ota (Mitsubishi Electric Corporation) · Tomoaki Oiki (Mitsubishi Electric) · Devesh Jha (Mitsubishi Electric Research Labs) · Toshisada Mariyama (Mitsubishi Electric) · Daniel Nikovski (Mitsubishi Electric Research Labs)
Byung-Jun Lee (KAIST) · Jongmin Lee (KAIST) · Peter Vrancx (PROWLER.io) · Dongho Kim (Prowler.io) · Kee-Eung Kim (KAIST)
Tom Jurgenson (Technion) · Or Avner (Technion) · Edward Groshev (Osaro, Inc.) · Aviv Tamar (Technion)
Adrià Puigdomenech Badia (Deepmind) · Bilal Piot (DeepMind) · Steven Kapturowski (Deepmind) · Pablo Sprechmann (Google DeepMind) · Oleksandr Vitvitskyi (DeepMind) · Zhaohan Guo (DeepMind) · Charles Blundell (DeepMind)
John Martin (Stevens Institute of Technology) · Michal Lyskawinski (Stevens Institute of Technology) · Xiaohu Li (Stevens Institute of Technology) · Brendan Englot (Stevens Institute of Technology)
Amélie Héliou (Criteo) · Panayotis Mertikopoulos (CNRS) · Zhengyuan Zhou (Stanford University)
Roberta Raileanu (NYU) · Max Goldstein (NYU) · Arthur Szlam (Facebook) · Facebook Rob Fergus (Facebook AI Research, NYU)
Salman Sadiq Shuvo (University of South Florida) · Yasin Yilmaz (University of South Florida) · Alan Bush (University of South Florida) · Mark Hafen (University of South Florida)
Remi Munos (DeepMind) · Julien Perolat (DeepMind) · Jean-Baptiste Lespiau (DeepMind) · Mark Rowland (DeepMind) · Bart De Vylder (DeepMind) · Marc Lanctot (DeepMind) · Finbarr Timbers (DeepMind) · Daniel Hennes (DeepMind) · Shayegan Omidshafiei (DeepMind) · Audrunas Gruslys (DeepMind) · Mohammad Gheshlaghi Azar (Deepmind) · Edward Lockhart (DeepMind) · Karl Tuyls (DeepMind)
Daniel Jarrett (University of Cambridge) · Mihaela van der Schaar (University of Cambridge)
Hippolyte Bourel (ENS Rennes) · Odalric-Ambrym Maillard (Inria Lille - Nord Europe) · Mohammad Sadegh Talebi (University of Copenhagen)
Zhaohan Guo (DeepMind) · Bernardo Avila Pires (DeepMind) · Mohammad Gheshlaghi Azar (Deepmind) · Bilal Piot (DeepMind) · Florent Altché (DeepMind) · Jean-Bastien Grill (DeepMind) · Remi Munos (DeepMind)
Clare Lyle (University of Oxford) · Amy Zhang (McGill University) · Angelos Filos (University of Oxford) · Shagun Sodhani (Facebook AI Research) · Marta Kwiatkowska (Oxford University) · Yarin Gal (University of Oxford) · Doina Precup (McGill University / DeepMind) · Joelle Pineau (McGill University / Facebook)
Qianli Shen (Peking University) · Yan Li (Georgia Tech) · Haoming Jiang (Georgia Tech) · Zhaoran Wang (Northwestern) · Tuo Zhao (Gatech)
Chen-Yu Wei (University of Southern California) · Mehdi Jafarnia (University of Southern California) · Haipeng Luo (University of Southern California) · Hiteshi Sharma (University of Southern California) · Rahul Jain (USC)
Dipendra Misra (Microsoft) · Mikael Henaff (Microsoft) · Akshay Krishnamurthy (Microsoft Research) · John Langford (Microsoft Research)
Yaqi Duan (Princeton University) · Zeyu Jia (Peking University) · Mengdi Wang (Princeton University)
Rui Wang (Uber AI) · Joel Lehman () · Aditya Rawal (Uber AI Labs) · Jiale Zhi (Uber AI) · Yulun Li (Uber AI) · Jeffrey Clune (Open AI) · Kenneth Stanley (Uber AI and University of Central Florida)
Xuezhou Zhang (UW-Madison) · Yuzhe Ma (Univ. of Wisconsin-Madison) · Adish Singla (Max Planck Institute (MPI-SWS)) · Jerry Zhu (University of Wisconsin-Madison)
Maggie Makar (MIT) · Fredrik Johansson (Chalmers University of Technology) · John Guttag (MIT) · David Sontag (Massachusetts Institute of Technology)
Yuda Song (University of California, San Diego) · Aditi Mavalankar (University of California San Diego) · Wen Sun (Microsoft Research) · Sicun Gao (University of California, San Diego)
Gabriele Farina (Carnegie Mellon University) · Christian Kroer (Columbia University) · Tuomas Sandholm (Carnegie Mellon University)
Silviu Pitis (University of Toronto) · Harris Chan (University of Toronto, Vector Institute) · Stephen Zhao (University of Toronto) · Bradly Stadie (Vector Institute) · Jimmy Ba (University of Toronto)
Jesse Zhang (UC Berkeley) · Brian Cheung (UC Berkeley) · Chelsea Finn (Stanford) · Sergey Levine (UC Berkeley) · Dinesh Jayaraman (University of Pennsylvania)
Rishabh Agarwal (Google Research, Brain Team) · Dale Schuurmans (Google / University of Alberta) · Mohammad Norouzi (Google Brain)
Gellért Weisz (DeepMind) · Tor Lattimore (DeepMind) · Csaba Szepesvari (DeepMind/University of Alberta)
Dibya Ghosh (Google) · Marc Bellemare (Google Brain)
Yihao Feng (The University of Texas at Austin) · Tongzheng Ren (UT Austin) · Ziyang Tang (University of Texas at Austin) · Qiang Liu (UT Austin)
Manan Tomar (Indian Institute of Technology, Madras) · Yonathan Efroni (Technion) · Mohammad Ghavamzadeh (Facebook AI Research)
Jincheng Mei (Google / University of Alberta) · Chenjun Xiao (Google / University of Alberta) · Csaba Szepesvari (DeepMind/University of Alberta) · Dale Schuurmans (University of Alberta)
Ashley Edwards (Uber AI) · Himanshu Sahni (Georgia Institute of Technology) · Rosanne Liu (Deep Collective) · Jane Hung (Uber) · Ankit Jain (Uber AI Labs) · Rui Wang (Uber AI) · Adrien Ecoffet (Uber AI) · Thomas Miconi (Uber AI Labs) · Charles Isbell (Georgia Institute of Technology) · Jason Yosinski (Uber Labs)
Omer Gottesman (Harvard University) · Joseph Futoma (Harvard University) · Yao Liu (Stanford University) · Sonali Parbhoo (Harvard University) · Leo Celi (MIT) · Emma Brunskill (Stanford University) · Finale Doshi-Velez (Harvard University)
Michael Laskin (UC Berkeley) · Pieter Abbeel (UC Berkeley & Covariant) · Aravind Srinivas (UC Berkeley)
Mark Chen (OpenAI) · Alec Radford (OpenAI) · Rewon Child (OpenAI) · Jeffrey K Wu (OpenAI) · Heewoo Jun (OpenAI) · David Luan (OpenAI) · Ilya Sutskever (OpenAI)
Zhongxiang Dai (National University of Singapore) · Yizhou Chen (National University of Singapore) · Bryan Kian Hsiang Low (National University of Singapore) · Patrick Jaillet (MIT) · Teck-Hua Ho (National University of Singapore)
William Fedus (University of Montreal/Google Brain) · Prajit Ramachandran (Google) · Rishabh Agarwal (Google Research, Brain Team) · Yoshua Bengio (Mila / U. Montreal) · Hugo Larochelle (Google Brain) · Mark Rowland (DeepMind) · Will Dabney (DeepMind)
Adam Elmachtoub (Columbia University) · Jason Cheuk Nam Liang (MIT) · Ryan McNellis (Amazon)
Sai Krishna Gottipati (99andBeyond) · Boris Sattarov (99andBeyond) · Sufeng Niu (Linkedin) · Haoran Wei (University of Delaware) · Yashaswi Pathak (International Institute of Information Technology,Hyderabad) · Shengchao Liu (MILA-UdeM) · Shengchao Liu (Mila, Université de Montréal) · Simon Blackburn (Mila) · Karam Thomas (99andBeyond) · Connor Coley (MIT) · Jian Tang (HEC Montreal & MILA) · Sarath Chandar (Mila / École Polytechnique de Montréal) · Yoshua Bengio (Mila / U. Montreal)
Aidan Curtis (Rice University) · Minjian Xin (Shanghai Jiao Tong University) · Dilip Arumugam (Stanford University) · Kevin Feigelis (Stanford University) · Daniel Yamins (Stanford University)
Rui Shu (Stanford University) · Tung Nguyen (VinAI Research) · Yinlam Chow (Google) · Tuan Pham (VinAI) · Khoat Than (VinAI & HUST) · Mohammad Ghavamzadeh (Facebook) · Stefano Ermon (Stanford University) · Hung Bui (VinAI Research)
Hang Lai (Shanghai Jiao Tong University) · Jian Shen (Shanghai Jiao Tong University) · Weinan Zhang (Shanghai Jiao Tong University) · Yong Yu (Shanghai Jiao Tong University)
Yujia Jin (Stanford University) · Aaron Sidford (Stanford)
Abbas Abdolmaleki (Google DeepMind) · Sandy Huang (DeepMind) · Leonard Hasenclever (DeepMind) · Michael Neunert (Google DeepMind) · Martina Zambelli (DeepMind) · Murilo Martins (DeepMind) · Francis Song (DeepMind) · Nicolas Heess (DeepMind) · Raia Hadsell (DeepMind) · Martin Riedmiller (DeepMind)
总结3: 《强化学习导论》代码/习题答案大全
总结6: 万字总结 || 强化学习之路
完
第66篇:分布式强化学习框架Acme,并行性加强
第65篇:DQN系列(3): 优先级经验回放(PER)
第64篇:UC Berkeley开源RAD来改进强化学习算法
第61篇:David Sliver 亲自讲解AlphaGo、Zero
第59篇:Agent57在所有经典Atari 游戏中吊打人类
第58篇:清华开源「天授」强化学习平台
第57篇:Google发布"强化学习"框架"SEED RL"
第53篇:TRPO/PPO提出者John Schulman谈科研
第52篇:《强化学习》可复现性和稳健性,如何解决?
第51篇:强化学习和最优控制的《十个关键点》
第50篇:微软全球深度强化学习开源项目开放申请
第49篇:DeepMind发布强化学习库 RLax
第48篇:AlphaStar过程详解笔记
第47篇:Exploration-Exploitation难题解决方法
第45篇:DQN系列(1): Double Q-learning
第44篇:科研界最全工具汇总
第42篇:深度强化学习入门到精通资料综述
第41篇:顶会征稿 || ICAPS2020: DeepRL
第40篇:实习生招聘 || 华为诺亚方舟实验室
第39篇:滴滴实习生|| 深度强化学习方向
第37篇:Call For Papers# IJCNN2020-DeepRL
第36篇:复现"深度强化学习"论文的经验之谈
第35篇:α-Rank算法之DeepMind及Huawei改进
第34篇:从Paper到Coding, DRL挑战34类游戏
第31篇:强化学习,路在何方?
第30篇:强化学习的三种范例
第29篇:框架ES-MAML:进化策略的元学习方法
第28篇:138页“策略优化”PPT--Pieter Abbeel
第27篇:迁移学习在强化学习中的应用及最新进展
第26篇:深入理解Hindsight Experience Replay
第25篇:10项【深度强化学习】赛事汇总
第24篇:DRL实验中到底需要多少个随机种子?
第23篇:142页"ICML会议"强化学习笔记
第22篇:通过深度强化学习实现通用量子控制
第21篇:《深度强化学习》面试题汇总
第20篇:《深度强化学习》招聘汇总(13家企业)
第19篇:解决反馈稀疏问题之HER原理与代码实现
第17篇:AI Paper | 几个实用工具推荐
第16篇:AI领域:如何做优秀研究并写高水平论文?
第14期论文: 2020-02-10(8篇)
第13期论文:2020-1-21(共7篇)
第12期论文:2020-1-10(Pieter Abbeel一篇,共6篇)
第11期论文:2019-12-19(3篇,一篇OpennAI)
第10期论文:2019-12-13(8篇)
第9期论文:2019-12-3(3篇)
第8期论文:2019-11-18(5篇)
第7期论文:2019-11-15(6篇)
第6期论文:2019-11-08(2篇)
第5期论文:2019-11-07(5篇,一篇DeepMind发表)
第4期论文:2019-11-05(4篇)
第3期论文:2019-11-04(6篇)
第2期论文:2019-11-03(3篇)
第1期论文:2019-11-02(5篇)