In Joint Communication and Radar (JCR)-based Autonomous Vehicle (AV) systems, optimizing waveform structure is one of the most challenging tasks due to strong influences between radar and data communication functions. Specifically, the preamble of a data communication frame is typically leveraged for the radar function. As such, the higher number of preambles in a Coherent Processing Interval (CPI) is, the greater radar's performance is. In contrast, communication efficiency decreases as the number of preambles increases. Moreover, AVs' surrounding radio environments are usually dynamic with high uncertainties due to their high mobility, making the JCR's waveform optimization problem even more challenging. To that end, this paper develops a novel JCR framework based on the Markov decision process framework and recent advanced techniques in deep reinforcement learning. By doing so, without requiring complete knowledge of the surrounding environment in advance, the JCR-AV can adaptively optimize its waveform structure (i.e., number of frames in the CPI) to maximize radar and data communication performance under the surrounding environment's dynamic and uncertainty. Extensive simulations show that our proposed approach can improve the joint communication and radar performance up to 46.26% compared with those of conventional methods (e.g., greedy policy- and fixed waveform-based approaches).
翻译:在基于联合通信和雷达(JCR)的基于联合通信和自动飞行器(AV)系统中,优化波形结构由于雷达和数据通信功能之间的强大影响而成为最具挑战性的任务之一。具体地说,数据通信框架的序言通常是用于雷达功能的杠杆,因此,协调处理跨Val(CPI)的序言数量较多,因此雷达的性能更高。相反,通信效率随着序言数量的增加而下降。此外,AV的周围无线电环境通常具有动态性,由于其高度流动性而具有高度不确定性,使JCR的波形优化问题更加具有挑战性。为此,本文件根据Markov决定程序框架和最近的深入强化学习先进技术,开发了一个全新的JCR框架。通过这样做,在不要求事先充分了解周围环境的情况下,JCR-AV可以适应性地优化其波形结构(即CPI的框架数目),以便在周围环境的动态和不确定性下最大限度地提高雷达和数据通信绩效。广泛的模拟表明,我们拟议的办法可以改进联合通信和数据通信绩效,将常规汇率方法提高到46.6%,而常规汇率方法则比较了46.