Timely and reliable environment perception is fundamental to safe and efficient automated driving. However, the perception of standalone intelligence inevitably suffers from occlusions. A new paradigm, Cooperative Perception (CP), comes to the rescue by sharing sensor data from another perspective, i.e., from a cooperative vehicle (CoV). Due to the limited communication bandwidth, it is essential to schedule the most beneficial CoV, considering both the viewpoints and communication quality. Existing methods rely on the exchange of meta-information, such as visibility maps, to predict the perception gains from nearby vehicles, which induces extra communication and processing overhead. In this paper, we propose a new approach, learning while scheduling, for distributed scheduling of CP. The solution enables CoVs to predict the perception gains using past observations, leveraging the temporal continuity of perception gains. Specifically, we design a mobility-aware sensor scheduling (MASS) algorithm based on the restless multi-armed bandit (RMAB) theory to maximize the expected average perception gain. An upper bound on the expected average learning regret is proved, which matches the lower bound of any online algorithm up to a logarithmic factor. Extensive simulations are carried out on realistic traffic traces. The results show that the proposed MASS algorithm achieves the best average perception gain and improves recall by up to 4.2 percentage points compared to other learning-based algorithms. Finally, a case study on a trace of LiDAR frames qualitatively demonstrates the superiority of adaptive exploration, the key element of the MASS algorithm.
翻译:安全、高效的自动化驾驶是安全、高效自动驾驶的基本条件,但是,对独立情报的认识是及时、可靠的环境感知的基础。然而,对独立情报的认识不可避免地受到排斥的影响。一个新的范式,即合作感知(CP),是从另一个角度共享感官数据,从另一个角度,即合作性飞行器(CoV)的感知(CoV)的感知)到救援。由于通信带宽有限,有必要考虑到观点和通信质量,安排最有益的CoV。现有方法依靠交换元信息,如可见度地图,预测附近车辆的感知收益,这会引起额外的通信和处理间接费用。在本文中,我们提出了一种新的方法,即合作感知(CP),即合作感知(CP)从另一个角度共享感官数据。由于通信带宽宽宽广的带宽宽的带宽的带宽的带宽,因此有必要考虑到最有利的多臂带宽的带宽的带宽的带宽的带宽的带宽的带宽,现有方法可以用来预测附近车辆的感知益,从而产生额外的交流和处理。我们建议,在分配CPRLA值矩阵的定时,通过以往的内程内程内程内程上可以预测其他的感知觉识。</s>