In modern society, people should not be identified based on their disability, rather, it is environments that can disable people with impairments. Improvements to automatic Sign Language Recognition (SLR) will lead to more enabling environments via digital technology. Many state-of-the-art approaches to SLR focus on the classification of static hand gestures, but communication is a temporal activity, which is reflected by many of the dynamic gestures present. Given this, temporal information during the delivery of a gesture is not often considered within SLR. The experiments in this work consider the problem of SL gesture recognition regarding how dynamic gestures change during their delivery, and this study aims to explore how single types of features as well as mixed features affect the classification ability of a machine learning model. 18 common gestures recorded via a Leap Motion Controller sensor provide a complex classification problem. Two sets of features are extracted from a 0.6 second time window, statistical descriptors and spatio-temporal attributes. Features from each set are compared by their ANOVA F-Scores and p-values, arranged into bins grown by 10 features per step to a limit of the 250 highest-ranked features. Results show that the best statistical model selected 240 features and scored 85.96% accuracy, the best spatio-temporal model selected 230 features and scored 80.98%, and the best mixed-feature model selected 240 features from each set leading to a classification accuracy of 86.75%. When all three sets of results are compared (146 individual machine learning models), the overall distribution shows that the minimum results are increased when inputs are any number of mixed features compared to any number of either of the two single sets of features.
翻译:在现代社会中,不应根据残疾来识别人,相反,这是能够使残疾人残疾的环境。改进自动手语识别(SLR)将通过数字技术导致更有利的环境。许多最先进的SLR方法侧重于静态手势的分类,但通信是一种时间活动,许多动态手势都反映了这一点。鉴于这一点,在SLR中通常不考虑作出姿态期间的时间信息。在这项工作中,实验考虑了SL手势识别问题,即其执行期间动态动作变化如何,改进SL手势识别(SLR)将通过数字技术导致更有利的环境。改进自动手语识别(SLRRR)将带来更有利的环境。许多最先进的SLLRL方法将侧重于静态手势动作的分类,但通信是一种复杂的分类问题。有两组特征,即:SLOVA F-S-Streative(Sandlement ANOVA F-STR) 和 pvalive(PL)) 。本实验中,每个组合动作动作动作动作动作动作识别器数是每个步骤的10个特点,相对于机器学习的精确度为250的精确度,而最精度为每组,最精度为每组,最精度的精度为每组的精度为每组,最精度为每组的精度为每组。选举的精度的精度为每组。选举的精度为每组。