The disaggregation of base stations into discrete RAN functions introduces new threats to mobile networks, as failures in one RAN function can trigger cascading failures and disrupt the entire functional chain, impacting network performance and leading to outages. In this paper, we propose the first resilience mechanism leveraging the adaptive placement of RAN functions to mitigate disruptions and recover service continuity in the presence of compromised infrastructure. Our model detects disrupted RUs due to cascading failures, reacts by re-instantiating CU and DU in alternative cloud locations, and recovers service continuity by reestablishing functional chains. We formulate this recovery process as an optimization problem that maximizes post-failure network performance while considering computational and communication constraints of the infrastructure. We numerically evaluated our approach on a real-world mobile network topology under multiple failure scenarios, and demonstrated that our solution recovers up to 70% higher throughput compared to conventional resilience mechanisms.
翻译:基站解耦为离散的RAN功能为移动网络引入了新的威胁,因为单个RAN功能的故障可能引发级联失效,进而破坏整个功能链,影响网络性能并导致服务中断。本文提出了首个利用RAN功能自适应部署的弹性机制,以在基础设施受损时缓解中断并恢复服务连续性。我们的模型能够检测因级联失效而中断的RU,通过在备用云位置重新实例化CU和DU进行响应,并通过重建功能链恢复服务连续性。我们将此恢复过程建模为一个优化问题,在考虑基础设施计算与通信约束的同时,最大化故障后的网络性能。我们在真实移动网络拓扑下针对多种故障场景进行了数值评估,结果表明与传统弹性机制相比,我们的方案能够恢复高达70%的吞吐量提升。