Human activity recognition and analysis has always been one of the most active areas of pattern recognition and machine intelligence, with applications in various fields, including but not limited to exertion games, surveillance, sports analytics and healthcare. Especially in Human-Robot Interaction, human activity understanding plays a crucial role as household robotic assistants are a trend of the near future. However, state-of-the-art infrastructures that can support complex machine intelligence tasks are not always available, and may not be for the average consumer, as robotic hardware is expensive. In this paper we propose a novel action sequence encoding scheme which efficiently transforms spatio-temporal action sequences into compact representations, using Mahalanobis distance-based shape features and the Radon transform. This representation can be used as input for a lightweight convolutional neural network. Experiments show that the proposed pipeline, when based on state-of-the-art human pose estimation techniques, can provide a robust end-to-end online action recognition scheme, deployable on hardware lacking extreme computing capabilities.
翻译:人类活动认识和分析始终是模式识别和机器智能最活跃的领域之一,其应用领域包括但不局限于应用游戏、监视、体育分析和保健。特别是在人类机器人互动中,人类活动理解发挥着关键作用,因为家庭机器人助理是近期的趋势。然而,并非总能提供最先进的基础设施,支持复杂的机器智能任务,而且可能不是普通消费者的,因为机器人硬件费用昂贵。在本文件中,我们提出了一个新的行动序列编码计划,利用马哈拉诺比斯远程形状特征和雷顿变形,将时空动作序列有效地转化为缩缩影。这一表述可以用作轻量的转动神经网络的投入。实验显示,拟议的管道,如果以最先进的人类构成估计技术为基础,可以提供一个强有力的端到端的在线行动识别机制,可以用于缺乏极端计算能力的硬件。