Deaf individuals confront significant communication obstacles on a daily basis. Their inability to hear makes it difficult for them to communicate with those who do not understand sign language. Moreover, it presents difficulties in educational, occupational, and social contexts. By providing alternative communication channels, technology can play a crucial role in overcoming these obstacles. One such technology that can facilitate communication between deaf and hearing individuals is sign language recognition. We will create a robust system for sign language recognition in order to convert Indian Sign Language to text or speech. We will evaluate the proposed system and compare CNN and LSTM models. Since there are both static and gesture sign languages, a robust model is required to distinguish between them. In this study, we discovered that a CNN model captures letters and characters for recognition of static sign language better than an LSTM model, but it outperforms CNN by monitoring hands, faces, and pose in gesture sign language phrases and sentences. The creation of a text-to-sign language paradigm is essential since it will enhance the sign language-dependent deaf and hard-of-hearing population's communication skills. Even though the sign-to-text translation is just one side of communication, not all deaf or hard-of-hearing people are proficient in reading or writing text. Some may have difficulty comprehending written language due to educational or literacy issues. Therefore, a text-to-sign language paradigm would allow them to comprehend text-based information and participate in a variety of social, educational, and professional settings. Keywords: deaf and hard-of-hearing, DHH, Indian sign language, CNN, LSTM, static and gesture sign languages, text-to-sign language model, MediaPipe Holistic, sign language recognition, SLR, SLT
翻译:聋哑人士每天都面对着重重的沟通障碍。他们的听力障碍使得与不懂手语的人沟通变得困难。此外,在教育、职业和社交等方面也会出现困难。通过提供替代沟通渠道,技术可以在克服这些障碍方面起到关键作用。一种可以促进聋人和听力正常的人之间沟通的技术是手语识别。我们将创建一个强大的手语识别系统,将印度手语转换为文本或语音。我们将评估所提出的系统,并比较CNN和LSTM模型。由于静态和手势手语都存在,因此需要一个强大的模型来区分它们。在本研究中,我们发现CNN模型更好地捕获静态手语的字母和字符以进行识别,但在手势手语短语和句子中通过监控手、脸和姿势优于CNN。创建文本到手语的范例是必要的,因为它将增强依赖手语的聋哑人群的沟通技巧。虽然手语到文本的翻译只是沟通的一方面,但并不是所有聋哑人都擅长阅读或写作。有些人可能因教育或文化程度不高而难以理解书面语言。因此,文本到手语的范例将使他们能够理解基于文本的信息,并参与各种社交、教育和专业场合。关键词:聋哑人,DHH,印度手语,CNN,LSTM,静态和手势手语,文本到手语模型,MediaPipe Holistic,手语识别,SLR,SLT