This paper describes important considerations and challenges associated with online reinforcement-learning based waveform selection for target identification in frequency modulated continuous wave (FMCW) automotive radar systems. We present a novel learning approach based on satisficing Thompson sampling, which quickly identifies a waveform expected to yield satisfactory classification performance. We demonstrate through measurement-level simulations that effective waveform selection strategies can be quickly learned, even in cases where the radar must select from a large catalog of candidate waveforms. The radar learns to adaptively select a bandwidth for appropriate resolution and a slow-time unimodular code for interference mitigation in the scene of interest by optimizing an expected classification metric.
翻译:本文介绍了与在频率调制连续波(FMCC)汽车雷达系统中进行目标识别的在线强化学习基于波形选择有关的重要考虑和挑战。我们介绍了一种基于讽刺汤普森取样的新学习方法,该方法迅速发现一种预期能产生令人满意的分类性能的波形。我们通过测量级模拟表明,即使在雷达必须从大型候选波形目录中挑选出有效波形选择战略的情况下,也可以很快地学到有效的波形选择战略。雷达学会通过优化预期的分类指标,适应性地选择带宽,以便适当解析,并采用缓慢的单向单向代码,在感兴趣的地区减少干扰。