用于图像编辑的基于补补的图像存储注意 (Patch-Based Stochastic Attention for Image Editing)

Attention mechanisms have become of crucial importance in deep learning in recent years. These non-local operations, which are similar to traditional patch-based methods in image processing, complement local convolutions. However, computing the full attention matrix is an expensive step with a heavy memory and computational load. These limitations curb network architectures and performances, in particular for the case of high resolution images. We propose an efficient attention layer based on the stochastic algorithm PatchMatch, which is used for determining approximate nearest neighbors. We refer to our proposed layer as a "Patch-based Stochastic Attention Layer" (PSAL). Furthermore, we propose different approaches, based on patch aggregation, to ensure the differentiability of PSAL, thus allowing end-to-end training of any network containing our layer. PSAL has a small memory footprint and can therefore scale to high resolution images. It maintains this footprint without sacrificing spatial precision and globality of the nearest neighbours, which means that it can be easily inserted in any level of a deep architecture, even in shallower levels. We demonstrate the usefulness of PSAL on several image editing tasks, such as image inpainting and image colorization.

翻译：近年来,在深层学习中,关注机制变得至关重要。这些与图像处理中传统的基于补丁的方法相似的非本地操作,补充了本地演化。然而,计算完全关注矩阵是一个昂贵的一步,内存和计算负荷沉重。这些限制限制了网络架构和性能,特别是高分辨率图像的网络架构和性能。我们建议基于用于确定近邻的随机算法PatchMatch的高效关注层。我们称我们拟议的层为“基于批量的斯托切关注层 ” ( PSAL ) 。此外,我们提出基于补丁汇总的不同方法,以确保PSAL 的可差异性,从而允许对包含我们层的任何网络进行端到端的培训。 PSAL 拥有一个小的记忆足迹,因此可以比高分辨率图像大。我们建议以不牺牲最近的邻居的空间精确性和全球性来保持这一足迹,这意味着它可以很容易地插入任何深度结构的层次,甚至更浅层。我们展示了PSAL 的实用性,例如图像成像和彩色化。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日