We study lifelong visual perception in an embodied setup, where we develop new models and compare various agents that navigate in buildings and occasionally request annotations which, in turn, are used to refine their visual perception capabilities. The purpose of the agents is to recognize objects and other semantic classes in the whole building at the end of a process that combines exploration and active visual learning. As we study this task in a lifelong learning context, the agents should use knowledge gained in earlier visited environments in order to guide their exploration and active learning strategy in successively visited buildings. We use the semantic segmentation performance as a proxy for general visual perception and study this novel task for several exploration and annotation methods, ranging from frontier exploration baselines which use heuristic active learning, to a fully learnable approach. For the latter, we introduce a deep reinforcement learning (RL) based agent which jointly learns both navigation and active learning. A point goal navigation formulation, coupled with a global planner which supplies goals, is integrated into the RL model in order to provide further incentives for systematic exploration of novel scenes. By performing extensive experiments on the Matterport3D dataset, we show how the proposed agents can utilize knowledge from previously explored scenes when exploring new ones, e.g. through less granular exploration and less frequent requests for annotations. The results also suggest that a learning-based agent is able to use its prior visual knowledge more effectively than heuristic alternatives.
翻译:我们在一个包含式设置中研究终身视觉认识,我们开发新的模型,比较在建筑物中航行的各种物剂,偶尔要求说明,用来改进视觉认识能力;这些物剂的目的是在将探索与积极视觉学习相结合的过程结束时,在整个建筑中辨别物体和其他语义类;当我们在终身学习的背景下研究这项任务时,这些物剂应利用在以前访问过的环境中获得的知识,以指导其在相继访问过的建筑物中的探索和积极学习战略;我们利用语义分化性表现作为一般视觉感知的代用品,并研究这一新颖的勘探和批注方法,从利用超常积极学习的前沿勘探基线到完全可学习的方法;对于后者,我们采用深强化学习(RL)基于的物剂,共同学习导航和积极学习。一个点目标导航配有提供目标的全球规划器,被纳入RL模型,以进一步激励系统探索新场景。我们通过在MTERP3D数据集上进行广泛的实验和研究,我们展示了从远端探索性积极学习的物义学成果,从先前探索不那么频繁的物证,我们又建议如何利用新的探索先见性研究。