Human action recognition in computer vision has been widely studied in recent years. However, most algorithms consider only certain action specially with even high computational cost. That is not suitable for practical applications with multiple actions to be identified with low computational cost. To meet various application scenarios, this paper presents a unified human action recognition framework composed of two modules, i.e., multi-form human detection and corresponding action classification. Among them, an open-source dataset is constructed to train a multi-form human detection model that distinguishes a human being's whole body, upper body or part body, and the followed action classification model is adopted to recognize such action as falling, sleeping or on-duty, etc. Some experimental results show that the unified framework is effective for various application scenarios. It is expected to be a new application-driven AI paradigm for human action recognition.
翻译:近年来,对计算机愿景中的人类行动认识进行了广泛研究,但大多数算法仅考虑某些特别具有甚至高计算成本的行动,这不适合以低计算成本确定多种行动的实际应用。为了应对各种应用设想,本文件提出了一个由两个模块组成的人类行动认识统一框架,即多式人类探测和相应的行动分类,其中,构建了一个开放源数据集,以培养一种多式人类检测模型,区分一个人的整个身体、上体或身体部分体,并采用以下行动分类模型,以确认诸如坠落、睡眠或值班等行动。一些实验结果显示,统一框架对各种应用设想是有效的。预计这是一个以应用驱动的新的人类行动识别AI模式。