The training of a next-best-view (NBV) planner for visual place recognition (VPR) is a fundamentally important task in autonomous robot navigation, for which a typical approach is the use of visual experiences that are collected in the target domain as training data. However, the collection of a wide variety of visual experiences in everyday navigation is costly and prohibitive for real-time robotic applications. We address this issue by employing a novel {\it domain-invariant} NBV planner. A standard VPR subsystem based on a convolutional neural network (CNN) is assumed to be available, and its domain-invariant state recognition ability is proposed to be transferred to train the domain-invariant NBV planner. Specifically, we divide the visual cues that are available from the CNN model into two types: the output layer cue (OLC) and intermediate layer cue (ILC). The OLC is available at the output layer of the CNN model and aims to estimate the state of the robot (e.g., the robot viewpoint) with respect to the world-centric view coordinate system. The ILC is available within the middle layers of the CNN model as a high-level description of the visual content (e.g., a saliency image) with respect to the ego-centric view. In our framework, the ILC and OLC are mapped to a state vector and subsequently used to train a multiview NBV planner via deep reinforcement learning. Experiments using the public NCLT dataset validate the effectiveness of the proposed method.
翻译:在自主机器人导航中,培训下一个最佳视点识别(VPR)规划员是一项具有根本重要性的任务,其典型做法是使用目标领域收集的视觉经验作为培训数据;然而,收集日常导航中广泛多样的视觉经验对于实时机器人应用来说成本高昂,令人望而却步。我们通过使用新颖的 &it 域-内位变异) NBV 规划员来解决这一问题。假设可以提供基于动态神经神经网络(CNN)的VPR标准子系统,并建议将其域异性状态识别能力转让给培训域异性NBV规划员。具体地说,我们将CNN模型中现有的视觉提示分为两类:输出层提示(OLC)和中间层提示(ILC)。 CPLC模型的输出层可以使用CNNNC模型,目的是估计机器人(例如,机器人观点)对世界中心核心视图协调系统的状况,其域域异性状态识别能力被建议转让给培训域-异性NBV规划员。ILC模型的中间层中,即使用RCRCS的升级数据,随后用于REBR的图像。