With the emerging of 360-degree image/video, augmented reality (AR) and virtual reality (VR), the demand for analysing and processing spherical signals get tremendous increase. However, plenty of effort paid on planar signals that projected from spherical signals, which leading to some problems, e.g. waste of pixels, distortion. Recent advances in spherical CNN have opened up the possibility of directly analysing spherical signals. However, they pay attention to the full mesh which makes it infeasible to deal with situations in real-world application due to the extremely large bandwidth requirement. To address the bandwidth waste problem associated with 360-degree video streaming and save computation, we exploit Focused Icosahedral Mesh to represent a small area and construct matrices to rotate spherical content to the focused mesh area. We also proposed a novel VertexShuffle operation that can significantly improve both the performance and the efficiency compared to the original MeshConv Transpose operation introduced in UGSCNN. We further apply our proposed methods on super resolution model, which is the first to propose a spherical super-resolution model that directly operates on a mesh representation of spherical pixels of 360-degree data. To evaluate our model, we also collect a set of high-resolution 360-degree videos to generate a spherical image dataset. Our experiments indicate that our proposed spherical super-resolution model achieves significant benefits in terms of both performance and inference time compared to the baseline spherical super-resolution model that uses the simple MeshConv Transpose operation. In summary, our model achieves great super-resolution performance on 360-degree inputs, achieving 32.79 dB PSNR on average when super-resoluting 16x vertices on the mesh.
翻译:随着360度图像/视频的出现,现实(AR)和虚拟现实(VR)的扩大,分析和处理球状信号的需求大大增加。然而,对球状信号的预测导致一些问题,例如像素的浪费、扭曲。球状CNN的最近进步为直接分析球状信号开辟了可能性。然而,它们关注全网格,这使得无法处理现实世界应用程序中的情况,因为需要的带宽非常大。为了解决与360度视频流和节省计算有关的带宽浪费问题,我们利用Focus Icosahedal Mesh等球状信号来代表一个小区域,并构建矩阵,将球状内容旋转到焦点网状区域。我们还提议了一个新的螺旋外线外运动操作,可以大大改善性能和效率,与UGSCNN推出的原始Mescial模型Psion Transport操作。我们进一步将拟议的方法运用在超级分辨率模型上,这是第一个提议在Scloral-Sloral 超级分辨率的图像中进行一个Sloral-social Slex Slex Slex 模型的运行,从而在高度数据中直接生成数据模型上生成。