Advances in mobile communication capabilities open the door for closer integration of pre-hospital and in-hospital care processes. For example, medical specialists can be enabled to guide on-site paramedics and can, in turn, be supplied with live vitals or visuals. Consolidating such performance-critical applications with the highly complex workings of mobile communications requires solutions both reliable and efficient, yet easy to integrate with existing systems. This paper explores the application of Deep Deterministic Policy Gradient~(\ddpg) methods for learning a communications resource scheduling algorithm with special regards to priority users. Unlike the popular Deep-Q-Network methods, the \ddpg is able to produce continuous-valued output. With light post-processing, the resulting scheduler is able to achieve high performance on a flexible sum-utility goal.
翻译:移动通信能力的进步为院前和院内护理流程的更紧密集成打开了大门。例如,可以启用医学专家来指导现场医护人员,并随之获取实时生命体征或可视化图像。将这样的绩效关键应用程序与移动通信的高度复杂工作融合需要可靠而高效的解决方案,同时易于与现有系统集成。本文探讨了使用深度确定性策略梯度方法_\ddpg学习通信资源调度算法的应用,特别关注优先级用户。与广受欢迎的 Deep-Q Network 方法不同,\ddpg能够产生连续值输出。通过轻微后期处理,产生的调度程序能够在柔性总效用目标上取得高性能。