BlobGAN: 空间分解的场景演示 (BlobGAN: Spatially Disentangled Scene Representations) - 专知论文

会员服务 ·

0

INTERACT · Networking · UniFormer · 表示 · MoDELS ·

2022 年 5 月 5 日

BlobGAN: Spatially Disentangled Scene Representations

翻译：BlobGAN: 空间分解的场景演示

Dave Epstein,Taesung Park,Richard Zhang,Eli Shechtman,Alexei A. Efros

from arxiv, Project webpage available at http://www.dave.ml/blobgan

We propose an unsupervised, mid-level representation for a generative model of scenes. The representation is mid-level in that it is neither per-pixel nor per-image; rather, scenes are modeled as a collection of spatial, depth-ordered "blobs" of features. Blobs are differentiably placed onto a feature grid that is decoded into an image by a generative adversarial network. Due to the spatial uniformity of blobs and the locality inherent to convolution, our network learns to associate different blobs with different entities in a scene and to arrange these blobs to capture scene layout. We demonstrate this emergent behavior by showing that, despite training without any supervision, our method enables applications such as easy manipulation of objects within a scene (e.g., moving, removing, and restyling furniture), creation of feasible scenes given constraints (e.g., plausible rooms with drawers at a particular location), and parsing of real-world images into constituent parts. On a challenging multi-category dataset of indoor scenes, BlobGAN outperforms StyleGAN2 in image quality as measured by FID. See our project page for video results and interactive demo: http://www.dave.ml/blobgan

翻译：我们建议为一种基因化的场景模型提供一个不受监督的、中级的场景。场景是中级的, 因为它既不是每个像素, 也不是每个象素; 相反, 场景是作为空间、深度排序的“ 蓝球” 特征集的模型来建模的。阵列被不同地格置于一个功能网格上, 通过一个基因化的对立网络将它解码成图像。由于布料的空间统一性以及演动所固有的地点, 我们的网络学会将不同的小块与一个场景的不同实体联系起来, 并安排这些小块来捕捉场景的布局。我们展示了这种突发的行为, 我们展示了这一点,尽管没有经过任何监督培训, 我们的方法仍然能够使各种物体在场景( 例如移动、搬移、和重新整理家具) 上容易操作, 创造出可行的场景( 例如, 在某个特定地点有抽屉看似的房间), 并且将真实世界的图像分解成构成部分。在一个具有挑战性的多类的室内图像集中, BlobGAN 超越了图像/ 样式GAN 的图像质量项目, 测量了我们的图像/ slegrefrigmlalGAN 。在我们的图像中测量为我们的图像/ slegleglegmalgmalglegmalGAN 。

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

“CVPR 2020 接受论文列表 1470篇论文都在这了

“CVPR 2020 接受论文列表 1470篇论文都在这了

专知

71+阅读 · 2020年6月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

脂肪酸调控皱纹盘鲍Δ5脂肪酸去饱和酶基因表达的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

种植体表面带电涂层调控成骨细胞粘附、分化的信号通路研究

国家自然科学基金

0+阅读 · 2014年12月31日

调控少突胶质细胞细胞周期在精神分裂症发病机制与治疗中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

REGγ在多发性骨髓瘤中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

TGF-beta/Smads通路中RBM4参与的转录和转录后水平的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

BRD4诱导的EMT在NSCLC厄洛替尼耐药中作用及机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

功能化石墨烯量子点合成与荧光传感

国家自然科学基金

0+阅读 · 2012年12月31日

CO2诱导纳米乳液介质内中药纳米结构脂质载体的绿色组装、微结构调控与构效关系

国家自然科学基金

0+阅读 · 2012年12月31日

Narf影响细胞衰老的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation

Arxiv

0+阅读 · 2022年6月24日

FitGAN: Fit- and Shape-Realistic Generative Adversarial Networks for Fashion

FitGAN: Fit- and Shape-Realistic Generative Adversarial Networks for Fashion

Arxiv

0+阅读 · 2022年6月23日

Learning To Generate Scene Graph from Head to Tail

Learning To Generate Scene Graph from Head to Tail

Arxiv

0+阅读 · 2022年6月23日

Doubly Reparameterized Importance Weighted Structure Learning for Scene Graph Generation

Arxiv

0+阅读 · 2022年6月22日

Efficient visual object representation using a biologically plausible spike-latency code and winner-take-all inhibition

Efficient visual object representation using a biologically plausible spike-latency code and winner-take-all inhibition

Arxiv

0+阅读 · 2022年6月22日

CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models

Arxiv

17+阅读 · 2021年3月23日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

Arxiv

11+阅读 · 2018年12月8日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

Compositional GAN: Learning Conditional Image Composition

Compositional GAN: Learning Conditional Image Composition

Arxiv

31+阅读 · 2018年7月19日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《毁灭算法：解析以色列在加沙的AI军事行动》

【COLT 2025最新教程】语言生成

以机器速度锁定目标：人工智能的能力与局限

【ICML2025】通过在线世界模型规划的持续强化学习

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

“CVPR 2020 接受论文列表 1470篇论文都在这了

“CVPR 2020 接受论文列表 1470篇论文都在这了

专知

71+阅读 · 2020年6月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

相关论文

Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation

Arxiv

0+阅读 · 2022年6月24日

FitGAN: Fit- and Shape-Realistic Generative Adversarial Networks for Fashion

FitGAN: Fit- and Shape-Realistic Generative Adversarial Networks for Fashion

Arxiv

0+阅读 · 2022年6月23日

Learning To Generate Scene Graph from Head to Tail

Learning To Generate Scene Graph from Head to Tail

Arxiv

0+阅读 · 2022年6月23日

Doubly Reparameterized Importance Weighted Structure Learning for Scene Graph Generation

Arxiv

0+阅读 · 2022年6月22日

Efficient visual object representation using a biologically plausible spike-latency code and winner-take-all inhibition

Efficient visual object representation using a biologically plausible spike-latency code and winner-take-all inhibition

Arxiv

0+阅读 · 2022年6月22日

CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models

Arxiv

17+阅读 · 2021年3月23日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

GAN Dissection: Visualizing and Understanding Generative Adversarial Networks

Arxiv

11+阅读 · 2018年12月8日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

Compositional GAN: Learning Conditional Image Composition

Compositional GAN: Learning Conditional Image Composition

Arxiv

31+阅读 · 2018年7月19日

相关基金

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

脂肪酸调控皱纹盘鲍Δ5脂肪酸去饱和酶基因表达的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

种植体表面带电涂层调控成骨细胞粘附、分化的信号通路研究

国家自然科学基金

0+阅读 · 2014年12月31日

调控少突胶质细胞细胞周期在精神分裂症发病机制与治疗中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

REGγ在多发性骨髓瘤中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

TGF-beta/Smads通路中RBM4参与的转录和转录后水平的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

BRD4诱导的EMT在NSCLC厄洛替尼耐药中作用及机制研究

国家自然科学基金

1+阅读 · 2012年12月31日

功能化石墨烯量子点合成与荧光传感

国家自然科学基金

0+阅读 · 2012年12月31日

CO2诱导纳米乳液介质内中药纳米结构脂质载体的绿色组装、微结构调控与构效关系

国家自然科学基金

0+阅读 · 2012年12月31日

Narf影响细胞衰老的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员