revelation of MONet - 专知

会员服务 ·

0

revelation of MONet

2019 年 6 月 8 日 CreateAMind

MONet: Unsupervised Scene Decomposition and Representation

1 .总的来说，这边论文讲了如上图的事情，与传统VAE不同的地方是，多了一个注意力网络产生出mask。

2. 注意力网络部分用的U-net网络。

3. VAE decoder部分用的是spatial broadcast decoder。

4.具体细节和MASK的表示形式在论文中有详细说明。

这里探讨一下这个论文的诞生过程：

我看的时候遍历了整篇文章都没有找到一个理论依据：说明整个loss函数的优化方向会往注意力网络输出正确的mask的方向流动。对！完全没有给出数学证明，而且mask和vae decoder输出这两部分都是在变化的，你不确定它们的流动方向。

那么，它是怎么整的呢？

首先，他们（论文作者）只有一个直觉，于是提出这个假设：

, if a networkperforming some task can be repeatedly reused across scene elements with commonstructure (such as objects and other visual entities), its available capacity(limited for example by its architecture and weights) will be more eﬀectivelyutilised and thus will be more eﬃcient than the same network processing theentire scene at once.

也就是说，让一个图片被掩码成多个图片，这些图片有共同的一些结构，再让这些图片通过同一个vae网络，这样重建的结果比单独就一张图片通过vae网络更好。因为他们觉得网络的容量得到更好的利用。

本着这个想法他们开始做实验：

看左上角的图：蓝色不用mask，绿色是别的图片的mask，也就是错误的mask，红色是给出正确的mask。这个实验就证明了：如果你让loss最小化，他会朝着正确mask的方向流动，因为如图这个正确的mask是最小的。

总结：这个论文的实验思路是很棒的，在你给不出严密数学证明的时候。

登录查看更多

5

相关内容

MONET

MONET:Mobile Networks & Applications。 Explanation：移动网络与应用。 Publisher：Springer。 SIT：Mobile Networks & Applications

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

专知会员服务

33+阅读 · 2020年4月26日

卷积神经网络的概述论文:分析、应用和展望，21页pdf

卷积神经网络的概述论文:分析、应用和展望，21页pdf

专知会员服务

91+阅读 · 2020年4月7日

面向结构化数据的向量嵌入理论 | word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings of Structured Data

面向结构化数据的向量嵌入理论 | word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings of Structured Data

专知会员服务

52+阅读 · 2020年4月1日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

从信息论的角度来理解损失函数

从信息论的角度来理解损失函数

深度学习每日摘要

17+阅读 · 2019年4月7日

腊月廿八 | 强化学习-TRPO和PPO背后的数学

腊月廿八 | 强化学习-TRPO和PPO背后的数学

AI研习社

18+阅读 · 2019年2月2日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

PTGAN for Person Re-Identification

PTGAN for Person Re-Identification

统计学习与视觉计算组

4+阅读 · 2018年9月10日

【深度学习基础】4. Recurrent Neural Networks

【深度学习基础】4. Recurrent Neural Networks

微信AI

16+阅读 · 2017年7月19日

DAG-GNN: DAG Structure Learning with Graph Neural Networks

Arxiv

8+阅读 · 2019年4月22日

Learning When Not to Answer: A Ternary Reward Structure for Reinforcement Learning based Question Answering

Arxiv

6+阅读 · 2019年4月3日

Using Ternary Rewards to Reason over Knowledge Graphs with Deep Reinforcement Learning

Arxiv

3+阅读 · 2019年2月26日

Bayesian Convolutional Neural Networks

Arxiv

19+阅读 · 2018年6月27日

A Pose-Sensitive Embedding for Person Re-Identification with Expanded Cross Neighborhood Re-Ranking

Arxiv

3+阅读 · 2018年4月2日

VIP会员

相关主题

注意力网络

变分自编码

相关VIP内容

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

专知会员服务

33+阅读 · 2020年4月26日

卷积神经网络的概述论文:分析、应用和展望，21页pdf

卷积神经网络的概述论文:分析、应用和展望，21页pdf

专知会员服务

91+阅读 · 2020年4月7日

面向结构化数据的向量嵌入理论 | word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings of Structured Data

面向结构化数据的向量嵌入理论 | word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings of Structured Data

专知会员服务

52+阅读 · 2020年4月1日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

从代码基础模型到智能体与应用：代码智能的全面综述与实践指南

《北约认知战概念报告》

【MIT博士论文】高效的视觉合成生成模型

美海军放弃星座级转而采用国家安全巡逻舰设计

相关资讯

从信息论的角度来理解损失函数

从信息论的角度来理解损失函数

深度学习每日摘要

17+阅读 · 2019年4月7日

腊月廿八 | 强化学习-TRPO和PPO背后的数学

腊月廿八 | 强化学习-TRPO和PPO背后的数学

AI研习社

18+阅读 · 2019年2月2日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

PTGAN for Person Re-Identification

PTGAN for Person Re-Identification

统计学习与视觉计算组

4+阅读 · 2018年9月10日

【深度学习基础】4. Recurrent Neural Networks

【深度学习基础】4. Recurrent Neural Networks

微信AI

16+阅读 · 2017年7月19日

相关论文

DAG-GNN: DAG Structure Learning with Graph Neural Networks

Arxiv

8+阅读 · 2019年4月22日

Learning When Not to Answer: A Ternary Reward Structure for Reinforcement Learning based Question Answering

Arxiv

6+阅读 · 2019年4月3日

Using Ternary Rewards to Reason over Knowledge Graphs with Deep Reinforcement Learning

Arxiv

3+阅读 · 2019年2月26日

Bayesian Convolutional Neural Networks

Arxiv

19+阅读 · 2018年6月27日

A Pose-Sensitive Embedding for Person Re-Identification with Expanded Cross Neighborhood Re-Ranking

Arxiv

3+阅读 · 2018年4月2日

大家都在搜

朱克爱德华兹家族

大型语言模型

MIT博士论文

蓝牙安全攻防

从传统方法到深度学习—— bilateral filter 到 HDRNet的演进

微信扫码咨询专知VIP会员