Object detection and single image super-resolution are classic problems in computer vision (CV). The object detection task aims to recognize the objects in input images, while the image restoration task aims to reconstruct high quality images from given low quality images. In this paper, a two-stage framework for object detection and image restoration is proposed. The first stage uses YOLO series algorithms to complete the object detection and then performs image cropping. In the second stage, this work improves Swin Transformer and uses the new proposed algorithm to connect the Swin Transformer layer to design a new neural network architecture. We name the newly proposed network for image restoration SwinOIR. This work compares the model performance of different versions of YOLO detection algorithms on MS COCO dataset and Pascal VOC dataset, demonstrating the suitability of different YOLO network models for the first stage of the framework in different scenarios. For image super-resolution task, it compares the model performance of using different methods of connecting Swin Transformer layers and design different sizes of SwinOIR for use in different life scenarios. Our implementation code is released at https://github.com/Rubbbbbbbbby/SwinOIR.
翻译:对象探测任务旨在识别输入图像中的物体,而图像恢复任务则旨在从给定的低质量图像中重建高质量的图像。在本文中,提出了一个用于天体探测和图像恢复的两阶段框架。第一阶段使用YOLO系列算法完成天体探测,然后进行图像裁剪。在第二阶段,这项工作改进了Swin变换器,并使用新的拟议算法将Swin变换器层连接起来,设计一个新的神经网络结构。我们命名了新提议的图像恢复 SwinOIR 网络。这项工作比较了在 MS COCO 数据集和 Pascal VOC 数据集上不同版本的 YOLO 探测算法模型的模型性能,以显示不同情景下框架第一阶段的不同 YOLO 网络模型是否适合。对于图像超分辨率任务,它比较了使用不同方法连接 Swin变换器层和设计不同尺寸的 SwinOIR 网络结构设计用于不同生命情景的模型性能。我们的执行代码在 https://githubrubbbbbbbbby 上发布。</s>