Nowadays, online screen sharing and remote cooperation are becoming ubiquitous. However, the screen content may be downsampled and compressed during transmission, while it may be displayed on large screens or the users would zoom in for detail observation at the receiver side. Therefore, developing a strong and effective screen content image (SCI) super-resolution (SR) method is demanded. We observe that the weight-sharing upsampler (such as deconvolution or pixel shuffle) could be harmful to sharp and thin edges in SCIs, and the fixed scale upsampler makes it inflexible to fit screens with various sizes. To solve this problem, we propose an implicit transformer network for continuous SCI SR (termed as ITSRN++). Specifically, we propose a modulation based transformer as the upsampler, which modulates the pixel features in discrete space via a periodic nonlinear function to generate features for continuous pixels. To enhance the extracted features, we further propose an enhanced transformer as the feature extraction backbone, where convolution and attention branches are utilized parallelly. Besides, we construct a large scale SCI2K dataset to facilitate the research on SCI SR. Experimental results on nine datasets demonstrate that the proposed method achieves state-of-the-art performance for SCI SR (outperforming SwinIR by 0.74 dB for x3 SR) and also works well for natural image SR. Our codes and dataset will be released upon the acceptance of this work.
翻译:目前,在线屏幕共享和远程合作正在变得无处不在。 然而, 屏幕内容可能会在传输过程中被下印和压缩, 而屏幕内容可能会在大屏幕上显示, 并且可以在大屏幕上显示, 或者用户会放大以在接收方进行详细观察。 因此, 需要开发一个强大而有效的屏幕内容图像( SCI) 超分辨率( SR) 方法 。 我们观察到, 权重共享高温( 如分流或像素打杂) 可能会对 STIS 的锐利和薄色边缘有害, 而固定的缩放器会使屏幕无法适应不同尺寸的屏幕。 为了解决这个问题, 我们提议为连续的 SCI SR 建立隐含的变压器网络( 称为 ITSRN+++ ) 。 具体地说, 我们提议以调制成一个基于调制式的变压器, 通过定期的非线性能函数调节离散空间的像素特性, 以生成连续的像素。 为了增强我们提取的特性, 我们进一步提议一个增强变压器作为功能的立式图像主干,, 用于SCI 和关注 SSR 的SSR 的SL 的S- 的S- 的SL 工作将实现 的自然进度 。