IGN: 隐性生成网络 (IGN : Implicit Generative Networks) - 专知论文

会员服务 ·

0

生成器网络 · Atari · Performer · Networking · 判别函数 ·

2022 年 9 月 22 日

IGN : Implicit Generative Networks

翻译：IGN: 隐性生成网络

Haozheng Luo,Tianyi Wu,Feiyu Han,Zhijun Yan,Jianfen Zhang

In this work, we build recent advances in distributional reinforcement learning to give a state-of-art distributional variant of the model based on the IQN. We achieve this by using the GAN model's generator and discriminator function with the quantile regression to approximate the full quantile value for the state-action return distribution. We demonstrate improved performance on our baseline dataset - 57 Atari 2600 games in the ALE. Also, we use our algorithm to show the state-of-art training performance of risk-sensitive policies in Atari games with the policy optimization and evaluation.

翻译：在这项工作中,我们建设了最近在分配强化学习方面的进步,为基于IQN的模型提供了一个最先进的分配变体。我们通过使用GAN模型的生成器和带有四分位回归作用的区别函数来达到这一点,以接近国家行动回报分布的四分位值。我们展示了我们基线数据集的改进性能 - 57 Atari 2600游戏在ALE中的功能。此外,我们利用我们的算法来展示阿塔里游戏中风险敏感政策的最新培训性能,同时进行政策优化和评估。

0

相关内容

生成器网络

生成器网络

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

应用数学暑期学校（2015）

国家自然科学基金

5+阅读 · 2015年7月12日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

晶面调控砷化镓纳米线的原位掺杂与输运特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向异构并行系统的生物序列比对并行策略及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Diffusion-based Generative Speech Source Separation

Diffusion-based Generative Speech Source Separation

Arxiv

0+阅读 · 2022年10月31日

DORE: Document Ordered Relation Extraction based on Generative Framework

Arxiv

0+阅读 · 2022年10月28日

Differentially Private Generative Adversarial Networks with Model Inversion

Arxiv

0+阅读 · 2022年10月27日

ImGAGN:Imbalanced Network Embedding via Generative Adversarial Graph Networks

Arxiv

14+阅读 · 2021年6月5日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

VIP会员

文章信息

相关主题

生成器网络

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

相关论文

Diffusion-based Generative Speech Source Separation

Diffusion-based Generative Speech Source Separation

Arxiv

0+阅读 · 2022年10月31日

DORE: Document Ordered Relation Extraction based on Generative Framework

Arxiv

0+阅读 · 2022年10月28日

Differentially Private Generative Adversarial Networks with Model Inversion

Arxiv

0+阅读 · 2022年10月27日

ImGAGN:Imbalanced Network Embedding via Generative Adversarial Graph Networks

Arxiv

14+阅读 · 2021年6月5日

Generative Adversarial Autoencoder Networks

Arxiv

11+阅读 · 2018年3月23日

相关基金

应用数学暑期学校（2015）

国家自然科学基金

5+阅读 · 2015年7月12日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

晶面调控砷化镓纳米线的原位掺杂与输运特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向异构并行系统的生物序列比对并行策略及算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员