Convolutional neural networks (CNNs) have been not only widespread but also achieved noticeable results on numerous applications including image classification, restoration, and generation. Although the weight-sharing property of convolutions makes them widely adopted in various tasks, its content-agnostic characteristic can also be considered a major drawback. To solve this problem, in this paper, we propose a novel operation, called pixel adaptive kernel attention (PAKA). PAKA provides directivity to the filter weights by multiplying spatially varying attention from learnable features. The proposed method infers pixel-adaptive attention maps along the channel and spatial directions separately to address the decomposed model with fewer parameters. Our method is trainable in an end-to-end manner and applicable to any CNN-based models. In addition, we propose an improved information aggregation module with PAKA, called the hierarchical PAKA module (HPM). We demonstrate the superiority of our HPM by presenting state-of-the-art performance on semantic segmentation compared to the conventional information aggregation modules. We validate the proposed method through additional ablation studies and visualizing the effect of PAKA providing directivity to the weights of convolutions. We also show the generalizability of the proposed method by applying it to multi-modal tasks especially color-guided depth map super-resolution.
翻译:革命神经网络(CNNs)不仅非常广泛,而且在包括图像分类、恢复和生成在内的众多应用方面取得了显著成果。虽然分重力特性使革命的分量特性在各种任务中被广泛采用,但其内容不可知特性也可被视为一个重大退步。为了解决这个问题,我们在本文件中提议了一个新的操作,称为像素适应内核注意(PAKA) 。PAKA通过从可学习的特性中增加不同空间的注意,为过滤重量提供了直接性。拟议的方法将频道和空间方向上的像素适应性关注地图分开,分别用来用较少的参数处理分解模式。我们的方法可以以端到端的方式加以训练,并适用于任何CNN的模型。此外,我们提议与PAKA一起改进信息汇总模块,称为PAKA的等级模块(HPM)。我们展示了我们的HPM的优越性,在语义分解与常规信息组合模块相比,我们通过进一步的互换式研究来验证拟议的方法,特别是将PAKA的超度的可理解性分析性,我们又通过显示多度的可变性分析方法,从而展示了PAKAKA的多度的可变制方法。