This paper presents an implementation on child activity recognition (CAR) with a graph convolution network (GCN) based deep learning model since prior implementations in this domain have been dominated by CNN, LSTM and other methods despite the superior performance of GCN. To the best of our knowledge, we are the first to use a GCN model in child activity recognition domain. In overcoming the challenges of having small size publicly available child action datasets, several learning methods such as feature extraction, fine-tuning and curriculum learning were implemented to improve the model performance. Inspired by the contradicting claims made on the use of transfer learning in CAR, we conducted a detailed implementation and analysis on transfer learning together with a study on negative transfer learning effect on CAR as it hasn't been addressed previously. As the principal contribution, we were able to develop a ST-GCN based CAR model which, despite the small size of the dataset, obtained around 50% accuracy on vanilla implementations. With feature extraction and fine-tuning methods, accuracy was improved by 20%-30% with the highest accuracy being 82.24%. Furthermore, the results provided on activity datasets empirically demonstrate that with careful selection of pre-train model datasets through methods such as curriculum learning could enhance the accuracy levels. Finally, we provide preliminary evidence on possible frame rate effect on the accuracy of CAR models, a direction future research can explore.
翻译:本文介绍了关于儿童活动识别(CAR)的执行情况,其基础是图象混凝土网络(GCN),其基础是深层次学习模式,因为尽管有CNN、LSTM和其他方法的优异表现,但先前在这一领域的实施一直以CNN、LSTM和其他方法为主。据我们所知,我们首先在儿童活动识别领域使用GCN模式。在克服小规模公开儿童行动数据集的挑战方面,实施了地物提取、微调和课程学习等几种学习方法,以改进模型性能。在对在CAR使用转让学习的相互矛盾的说法的启发下,我们开展了关于转让学习的详细实施和分析,同时开展了关于对CAR的负面转移学习效果的研究。作为主要贡献,我们开发了基于ST-GCN的CAR模型,尽管数据集规模小,在香草实施方面获得了大约50%的准确性。通过特征提取和微调方法,精确性能提高了20%-30%,最高精确性能为82.24 %。此外,我们提供的关于活动数据采集结果,实验性地展示了对CAR的消极性学习效果,最后通过初步证据选择方法,可以提高未来数据库的精确度。