We tackle the problem of joint frequency and power allocation while emphasizing the generalization capability of a deep reinforcement learning model. Most of the existing methods solve reinforcement learning-based wireless problems for a specific pre-determined wireless network scenario. The performance of a trained agent tends to be very specific to the network and deteriorates when used in a different network operating scenario (e.g., different in size, neighborhood, and mobility, among others). We demonstrate our approach to enhance training to enable a higher generalization capability during inference of the deployed model in a distributed multi-agent setting in a hostile jamming environment. With all these, we show the improved training and inference performance of the proposed methods when tested on previously unseen simulated wireless networks of different sizes and architectures. More importantly, to prove practical impact, the end-to-end solution was implemented on the embedded software-defined radio and validated using over-the-air evaluation.
翻译:我们处理联合频率和电力分配问题,同时强调深度强化学习模式的普及能力。大多数现有方法解决了特定预先确定的无线网络情景的强化学习型无线问题。受过训练的代理的性能往往对网络非常特殊,在不同的网络操作情景(例如,不同大小、相邻和机动性等)中使用时会恶化。我们展示了我们加强培训的方法,以便在一个充满敌意的干扰环境中,在分布式多试剂环境中,在推断部署的模型时,提高一般化能力。所有这些方法都显示了在对以前不见的、规模和结构不同的模拟无线网络进行测试时,拟议方法的培训和推论效果得到改善。更重要的是,为了证明实际效果,终端到终端解决方案是在嵌入软件定义的无线电上实施,并使用超空评价加以验证。