We propose a lightweight, single image super-resolution network for mobile devices, named XCAT. XCAT introduces Heterogeneous Group Convolution Blocks with Cross Concatenations (HXBlock). The heterogeneous split of the input channels to the group convolution blocks reduces the number of operations, and cross concatenation allows for information flow between the intermediate input tensors of cascaded HXBlocks. Cross concatenations inside HXBlocks can also avoid using more expensive operations like 1x1 convolutions. To further prev ent expensive tensor copy operations, XCAT utilizes non-trainable convolution kernels to apply up sampling operations. Designed with integer quantization in mind, XCAT also utilizes several techniques on training, like intensity-based data augmentation. Integer quantized XCAT operates in real time on Mali-G71 MP2 GPU with 320ms, and on Synaptics Dolphin NPU with 30ms (NCHW) and 8.8ms (NHWC), suitable for real-time applications.
翻译:我们提议为移动设备建立一个轻量、单一图像超分辨率网络,名为XCAT。XCAT引入了具有交叉连接的异质集团组合区块(HXBlock) 。 向集团组合区块输入的渠道的多元分解减少了操作数量, 交叉共解允许串联HXlocks的中间输入阀门之间的信息流动。 在HXlocks内部的交叉连接也可以避免使用更昂贵的操作, 如 1x1 组合。 为了进一步预设昂贵的高压拷贝操作, XCAT使用非可控的组合内核来应用取样操作。 XCAT还利用了几种培训技术, 如基于强度的数据增强。 Integer 定量XCAT实时操作马里-G71 MP2 GPU, 320米, 以及配有30米和8.8米的合成多芬 NPUPU(NHWC), 适合实时应用。