While inferring common actor states (such as position or velocity) is an important and well-explored task of the perception system aboard a self-driving vehicle (SDV), it may not always provide sufficient information to the SDV. This is especially true in the case of active emergency vehicles (EVs), where light-based signals also need to be captured to provide a full context. We consider this problem and propose a sequential methodology for the detection of active EVs, using an off-the-shelf CNN model operating at a frame level and a downstream smoother that accounts for the temporal aspect of flashing EV lights. We also explore model improvements through data augmentation and training with additional hard samples.
翻译:虽然推断通用行为者状态(如位置或速度)是自驾驶车(SDV)上感知系统的一项重要和探索周密的任务,但它可能并不总是向SDV提供足够的信息。对于活动应急车(EVs)来说尤其如此,因为光基信号也需要捕捉,以提供一个完整的背景。我们考虑这一问题,并提出探测活性EV的顺序方法,使用现成CNN模型,在框架一级运行,下游光滑,说明闪光EV灯的时间方面。我们还探索通过数据扩增和培训更多硬样品来改进模型。