Monocular person following (MPF) is a capability that supports many useful applications of a mobile robot. However, existing MPF solutions are not completely satisfactory. Firstly, they often fail to track the target at a close distance either because they are based on a visual servo or they need the observation of the full body by the robot. Secondly, their target Re-IDentification (Re-ID) abilities are weak in cases of target appearance change and highly similar appearance of distracting people. To remove the assumption of full-body observation, we propose a width-based tracking module, which relies on the target width, which can be observed even at a close distance. For handling issues related to appearance variation, we use a global CNN (convolutional neural network) descriptor to represent the target and a ridge regression model to learn a target appearance model online. We adopt a sampling strategy for online classifier learning, in which both long-term and short-term samples are involved. We evaluate our method in two datasets including a public person following dataset and a custom-built one with challenging target appearance and target distance. Our method achieves state-of-the-art (SOTA) results on both datasets. For the benefit of the community, we make public the dataset and the source code.
翻译:跟踪(MPF)的单体人是一种支持移动机器人许多有用应用的能力。 但是,现有的 MPF 解决方案并不完全令人满意。 首先,它们往往无法在近距离跟踪目标,因为它们基于视觉信号或需要机器人对全身进行观察。 其次,在目标外观变化的情况下,其目标再识别(重新识别)能力较弱,而且转移人的外观非常相似。为了消除对全体观察的假设,我们提议一个宽度跟踪模块,该模块依靠目标宽度,甚至可以近距离观测。在处理与外观变化有关的问题时,我们使用全球CNN(革命神经网络)描述仪代表目标,并使用山脊回归模型在网上学习目标外观模型。我们采用了一个用于在线分析学习的抽样战略,其中涉及长期和短期的样本。我们用两个数据集来评估我们的方法,包括一个遵循数据集的公众,一个定制的、一个具有挑战性外观和目标距离的定制的数据集。我们的方法在公共代码上都实现状态数据,我们的方法在公共代码上获取数据。