In this paper, we propose a wavelet-based video codec specifically designed for VR displays that enables real-time playback of high-resolution 360{\deg} videos. Our codec exploits the fact that only a fraction of the full 360{\deg} video frame is visible on the display at any time. To load and decode the video viewport-dependently in real time, we make use of the wavelet transform for intra- as well as inter-frame coding. Thereby, the relevant content is directly streamed from the drive, without the need to hold the entire frames in memory. With an average of 193 frames per second at 8192x8192-pixel full-frame resolution, the conducted evaluation demonstrates that our codec's decoding performance is up to 272% higher than that of the state-of-the-art video codecs H.265 and AV1 for typical VR displays. By means of a perceptual study, we further illustrate the necessity of high frame rates for a better VR experience. Finally, we demonstrate how our wavelet-based codec can also directly be used in conjunction with foveation for further performance increase.
翻译:在本文中, 我们建议为 VR 显示器专门设计一个波盘式视频编码器, 使高分辨率 360\ deg} 视频能够实时重播。 我们的编码器利用了一个事实, 即整个360\ deg} 视频框架中只有部分能随时在显示器上看到。 要实时地装载和解码视频屏幕, 我们使用波盘转换功能进行内部和内部编码。 因此, 相关内容直接从驱动器中流出, 不需要保存整个框架的记忆。 我们进行的编码器平均每秒193个框架, 平均为 892x8192- pixel 全文分辨率, 所进行的评价显示我们的编码器解码性能比高级视频编码器H. 265 和 AV1 的性能高272%, 用于典型 VR 显示器。 我们通过一种概念性研究, 进一步说明改进 VR 经验的高框架率的必要性。 最后, 我们演示我们的波盘调调调码器如何直接用于进一步提高性能。