Recent studies in Human Activity Recognition (HAR) have shown that Deep Learning methods are able to outperform classical Machine Learning algorithms. One popular Deep Learning architecture in HAR is the DeepConvLSTM. In this paper we propose to alter the DeepConvLSTM architecture to employ a 1-layered instead of a 2-layered LSTM. We validate our architecture change on 5 publicly available HAR datasets by comparing the predictive performance with and without the change employing varying hidden units within the LSTM layer(s). Results show that across all datasets, our architecture consistently improves on the original one: Recognition performance increases up to 11.7% for the F1-score, and our architecture significantly decreases the amount of learnable parameters. This improvement over DeepConvLSTM decreases training time by as much as 48%. Our results stand in contrast to the belief that one needs at least a 2-layered LSTM when dealing with sequential data. Based on our results we argue that said claim might not be applicable to sensor-based HAR.
翻译:人类活动识别(HAR)的最新研究表明,深学习方法能够超越经典机器学习算法。HAR中流行的深学习结构之一是深ConvLSTM。在本文中,我们提议修改深ConvLSTM结构,以使用一层而不是二层LSTM结构。我们通过比较LSTM层内使用不同隐藏单位的预测性能和不使用不同变化来验证5个公开提供的HAR数据集的结构变化。结果显示,在所有数据集中,我们的架构不断改进:F1核心的识别性能提高到11.7%,而我们的架构大幅降低可学习参数的数量。深海ConvLSTM的这一改进将培训时间减少48%。我们的结果与以下信念形成对比:在处理连续数据时,至少需要2层LSTM。基于我们的结果,我们认为,上述主张可能不适用于基于传感器的HAR。