在动态环境中不断优化无线资源学习:双层优化观点 (Learning to Continuously Optimize Wireless Resource in a Dynamic Environment: A Bilevel Optimization Perspective)

There has been a growing interest in developing data-driven, and in particular deep neural network (DNN) based methods for modern communication tasks. For a few popular tasks such as power control, beamforming, and MIMO detection, these methods achieve state-of-the-art performance while requiring less computational efforts, less resources for acquiring channel state information (CSI), etc. However, it is often challenging for these approaches to learn in a dynamic environment. This work develops a new approach that enables data-driven methods to continuously learn and optimize resource allocation strategies in a dynamic environment. Specifically, we consider an ``episodically dynamic" setting where the environment statistics change in ``episodes", and in each episode the environment is stationary. We propose to build the notion of continual learning (CL) into wireless system design, so that the learning model can incrementally adapt to the new episodes, {\it without forgetting} knowledge learned from the previous episodes. Our design is based on a novel bilevel optimization formulation which ensures certain ``fairness" across different data samples. We demonstrate the effectiveness of the CL approach by integrating it with two popular DNN based models for power control and beamforming, respectively, and testing using both synthetic and ray-tracing based data sets. These numerical results show that the proposed CL approach is not only able to adapt to the new scenarios quickly and seamlessly, but importantly, it also maintains high performance over the previously encountered scenarios as well.

翻译：开发基于数据驱动的现代通信任务,特别是深神经网络(DNN)方法的兴趣日益浓厚。对于一些流行的任务,如电源控制、波束成形和MIMO探测等,这些方法达到最先进的性能,同时要求较少计算努力,较少获得频道状态信息的资源等。然而,这些方法往往具有挑战性,在动态环境中学习这些方法往往具有挑战性。这项工作开发出一种新的方法,使数据驱动方法能够在动态环境中不断学习和优化资源分配战略。具体地说,我们考虑一种“极端动态的动态”设置,环境统计在“线条”中发生变化,在每种情况下环境都是静止的。我们提议将持续学习的概念纳入无线系统设计,这样学习模式可以逐步适应新情况,在不忘却前一时期所学到的知识。我们的设计基于一个新的双层优化公式,确保在不同数据样本中保持某种“公平性”。我们通过将CL方法与两个基于大众的、基于快速数据控制模式的快速调整模式结合起来,我们展示了CL方法的有效性。我们通过将CL方法与两个基于合成的快速测试和基于数字的数据模型分别显示基于的快速测试的快速测试结果,我们所拟议的快速地展示了该方法,我们的设计是将其转化为的快速调整为基于基于的快速的快速测试和数字的数据。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

专知会员服务

39+阅读 · 2020年11月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

深度强化学习策略梯度教程，53页ppt