Light field (LF) imaging has gained significant attention due to its recent success in 3-dimensional (3D) displaying and rendering as well as augmented and virtual reality usage. Nonetheless, because of the two extra dimensions, LFs are much larger than conventional images. We develop a JPEG-assisted learning-based technique to reconstruct an LF from a JPEG bitstream with a bit per pixel ratio of 0.0047 on average. For compression, we keep the LF's center view and use JPEG compression with 50% quality. Our reconstruction pipeline consists of a small JPEG enhancement network (JPEG-Hance), a depth estimation network (Depth-Net), followed by view synthesizing by warping the enhanced center view. Our pipeline is significantly faster than using video compression on pseudo-sequences extracted from an LF, both in compression and decompression, while maintaining effective performance. We show that with a 1% compression time cost and 18x speedup for decompression, our methods reconstructed LFs have better structural similarity index metric (SSIM) and comparable peak signal-to-noise ratio (PSNR) compared to the state-of-the-art video compression techniques used to compress LFs.
翻译:光场( LF) 成像因其最近在3维( 3D) 显示和显示以及扩大和虚拟现实使用方面所取得的成功而得到极大关注。 然而,由于两个额外的维度,LF比常规图像要大得多。我们开发了JPEG辅助学习基础技术,从JPEG位流中重建LF, 平均比重为0.0047。关于压缩,我们保留LF的中心视图,并以50%的质量使用JPEG压缩。我们的重建管道包括一个小型JPEG增强网络(JPEG-Hance),一个深度估计网络(Deph-Net),然后通过扭曲增强的中心视图合成视图。我们的管道比在压缩和减压中从LF提取的假结果上使用视频压缩速度快得多,同时保持有效的性能。我们显示,在压缩1 %的时间成本和18x加速速度的情况下,我们重建的LFs的方法结构上更相似的指数度指标(SSIM- HID- ) 和与所使用的最高峰信号- RFS- 对比。