Human Activity Recognition (HAR) plays a vital role in healthcare, surveillance, and innovative environments, where reliable action recognition supports timely decision-making and automation. Although deep learning-based HAR systems are widely adopted, the impact of Activation Functions (AFs) and Model Optimizers (MOs) on performance has not been sufficiently analyzed, particularly regarding how their combinations influence model behavior in practical scenarios. Most existing studies focus on architecture design, while the interaction between AF and MO choices remains relatively unexplored. In this work, we investigate the effect of three commonly used activation functions (ReLU, Sigmoid, and Tanh) combined with four optimization algorithms (SGD, Adam, RMSprop, and Adagrad) using two recurrent deep learning architectures, namely BiLSTM and ConvLSTM. Experiments are conducted on six medically relevant activity classes selected from the HMDB51 and UCF101 datasets, considering their suitability for healthcare-oriented HAR applications. Our experimental results show that ConvLSTM consistently outperforms BiLSTM across both datasets. ConvLSTM, combined with Adam or RMSprop, achieves an accuracy of up to 99.00%, demonstrating strong spatio-temporal learning capabilities and stable performance. While BiLSTM performs reasonably well on UCF101, with accuracy approaching 98.00%, its performance drops to approximately 60.00% on HMDB51, indicating limited robustness across datasets and weaker sensitivity to AF and MO variations. This study provides practical insights for optimizing HAR systems, particularly for real-world healthcare environments where fast and precise activity detection is critical.
翻译:人体活动识别在医疗健康、监控安防及创新环境中扮演着关键角色,可靠的行为识别能够支持及时决策与自动化实现。尽管基于深度学习的HAR系统已被广泛采用,但激活函数与模型优化器对系统性能的影响尚未得到充分分析,尤其在实际场景中二者的组合如何影响模型行为仍不明确。现有研究多聚焦于架构设计,而AF与MO选择间的相互作用尚未得到深入探索。本研究采用两种循环深度学习架构(BiLSTM与ConvLSTM),系统探究三种常用激活函数(ReLU、Sigmoid与Tanh)与四种优化算法(SGD、Adam、RMSprop及Adagrad)的组合效应。实验基于从HMDB51和UCF101数据集中筛选的六类医疗相关活动展开,所选活动类别符合医疗健康导向的HAR应用需求。实验结果表明:ConvLSTM在两个数据集上均持续优于BiLSTM。ConvLSTM结合Adam或RMSprop优化器时,最高准确率达99.00%,展现出强大的时空学习能力与稳定性能。虽然BiLSTM在UCF101数据集上表现良好(准确率接近98.00%),但在HMDB51数据集上性能下降至约60.00%,表明其跨数据集鲁棒性有限,且对AF与MO变化的敏感性较弱。本研究为优化HAR系统提供了实践指导,尤其对实时性与精确性要求严苛的现实医疗场景具有重要参考价值。