Virtual reality (VR) is promising to fundamentally transform a broad spectrum of industry sectors and the way humans interact with virtual content. However, despite unprecedented progress, current networking and computing infrastructures are incompetent to unlock VR's full potential. In this paper, we consider delivering the wireless multi-tile VR video service over a mobile edge computing (MEC) network. The primary goal is to minimize the system latency/energy consumption and to arrive at a tradeoff thereof. To this end, we first cast the time-varying view popularity as a model-free Markov chain to effectively capture its dynamic characteristics. After jointly assessing the caching and computing capacities on both the MEC server and the VR playback device, a hybrid policy is then implemented to coordinate the dynamic caching replacement and the deterministic offloading, so as to fully utilize the system resources. The underlying multi-objective problem is reformulated as a partially observable Markov decision process, and a deep deterministic policy gradient algorithm is proposed to iteratively learn its solution, where a long short-term memory neural network is embedded to continuously predict the dynamics of the unobservable popularity. Simulation results demonstrate the superiority of the proposed scheme in achieving a trade-off between the energy efficiency and the latency reduction over the baseline methods.
翻译:虚拟现实(VR)有望从根本上改变广泛的产业部门和人类与虚拟内容互动的方式。然而,尽管取得了前所未有的进步,但目前的联网和计算基础设施无法完全释放VR的全部潜力。在本文件中,我们考虑通过移动边缘计算(MEC)网络提供无线多盘VR视频服务。主要目标是尽量减少系统延缓度/能源消耗,并实现平衡。为此,我们首先将时间变化的受欢迎度作为无模型的Markov链条,以有效捕捉其动态特征。在联合评估MEC服务器和VR回放装置的缓冲和计算能力之后,将实施混合政策,以协调动态缓冲替换和确定性卸载功能,从而充分利用系统资源。潜在的多目标问题被重新改写为部分可观测的Markov决策程序,并提议深层的确定性政策梯度算法,以迭代学习其解决方案,其中长期的短期记忆神经网络将嵌入在内,以持续预测无法观测的贸易效率降低的基线方法之间的动态。