MEC 辅助VR录像服务公司能源-长期权衡的混合政策学习 (Hybrid Policy Learning for Energy-Latency Tradeoff in MEC-Assisted VR Video Service)

Virtual reality (VR) is promising to fundamentally transform a broad spectrum of industry sectors and the way humans interact with virtual content. However, despite unprecedented progress, current networking and computing infrastructures are incompetent to unlock VR's full potential. In this paper, we consider delivering the wireless multi-tile VR video service over a mobile edge computing (MEC) network. The primary goal is to minimize the system latency/energy consumption and to arrive at a tradeoff thereof. To this end, we first cast the time-varying view popularity as a model-free Markov chain to effectively capture its dynamic characteristics. After jointly assessing the caching and computing capacities on both the MEC server and the VR playback device, a hybrid policy is then implemented to coordinate the dynamic caching replacement and the deterministic offloading, so as to fully utilize the system resources. The underlying multi-objective problem is reformulated as a partially observable Markov decision process, and a deep deterministic policy gradient algorithm is proposed to iteratively learn its solution, where a long short-term memory neural network is embedded to continuously predict the dynamics of the unobservable popularity. Simulation results demonstrate the superiority of the proposed scheme in achieving a trade-off between the energy efficiency and the latency reduction over the baseline methods.

翻译：虚拟现实(VR)有望从根本上改变广泛的产业部门和人类与虚拟内容互动的方式。然而,尽管取得了前所未有的进步,但目前的联网和计算基础设施无法完全释放VR的全部潜力。在本文件中,我们考虑通过移动边缘计算(MEC)网络提供无线多盘VR视频服务。主要目标是尽量减少系统延缓度/能源消耗,并实现平衡。为此,我们首先将时间变化的受欢迎度作为无模型的Markov链条,以有效捕捉其动态特征。在联合评估MEC服务器和VR回放装置的缓冲和计算能力之后,将实施混合政策,以协调动态缓冲替换和确定性卸载功能,从而充分利用系统资源。潜在的多目标问题被重新改写为部分可观测的Markov决策程序,并提议深层的确定性政策梯度算法,以迭代学习其解决方案,其中长期的短期记忆神经网络将嵌入在内,以持续预测无法观测的贸易效率降低的基线方法之间的动态。

相关内容

关注 23

IEEE虚拟现实会议一直是展示虚拟现实(VR)广泛领域研究成果的主要国际场所，包括增强现实（AR），混合现实（MR）和3D用户界面中寻求高质量的原创论文。每篇论文应归类为主要涵盖研究，应用程序或系统，并使用以下准则进行分类：研究论文应描述有助于先进软件，硬件，算法，交互或人为因素发展的结果。应用论文应解释作者如何基于现有思想并将其应用到以新颖的方式解决有趣的问题。每篇论文都应包括对给定应用领域中VR/AR/MR使用成功的评估。官网地址：http://dblp.uni-trier.de/db/conf/vr/

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日