In tracking radar, the sensing environment often varies significantly over a track duration due to the target's trajectory and dynamic interference. Adapting the radar's waveform using partial information about the state of the scene has been shown to provide performance benefits in many practical scenarios. Moreover, radar measurements generally exhibit strong temporal correlation, allowing memory-based learning algorithms to effectively learn waveform selection strategies. This work examines a radar system which builds a compressed model of the radar-environment interface in the form of a context-tree. The radar uses this context tree-based model to select waveforms in a signal-dependent target channel, which may respond adversarially to the radar's strategy. This approach is guaranteed to asymptotically converge to the average-cost optimal policy for any stationary target channel that can be represented as a Markov process of order U < $\infty$, where the constant U is unknown to the radar. The proposed approach is tested in a simulation study, and is shown to provide tracking performance improvements over two state-of-the-art waveform selection schemes.
翻译:在跟踪雷达方面,由于目标的轨迹和动态干扰,在跟踪期内,感测环境往往差异很大。利用部分关于场景状况的信息对雷达波形进行调适,在许多实际情景中都显示能够带来性能效益。此外,雷达测量通常显示出很强的时间相关性,使基于记忆的学习算法能够有效地学习波形选择战略。这项工作审查了一个雷达系统,该系统以上下文树的形式建立雷达与环境界面的压缩模型。雷达利用这个基于背景的树型模型,在依赖信号的目标频道中选择波形,这种波形可能与雷达的战略对立。这一方法保证了任何固定目标频道的普通成本最佳政策无一例外地趋同,这种固定目标频道的顺序是U < $\ inty$,因为雷达对恒定值不知情。在模拟研究中测试了拟议的方法,并显示可以跟踪两个最先进的波形选择方案的性能改进情况。