Accent forms an integral part of identifying cultures, emotions,behavior's, etc. People often perceive each other in a different manner due to their accent. The accent itself can be a conveyor of status, pride, and other emotional information which can be captured through Speech itself. Accent itself can be defined as: "the way in which people in a particular area, country, or social group pronounce words" or "a special emphasis given to a syllable in a word, word in a sentence, or note in a set of musical notes". Accented Speech Recognition is one the most important problems in the domain of Speech Recognition. Speech recognition is an interdisciplinary sub-field of Computer Science and Linguistics research where the main aim is to develop technologies which enable conversion of speech into text. The speech can be of any form such as read speech or spontaneous speech, conversational speech. Speech unlike text has lot of diversity. This diversity stems from the environmental conditions, variabilities from speaker to speaker, channel noise, differences in Speech production due to disabilities, presence of disfluencies. Speech therefore is indeed a rich source of information waiting to be exploited.
翻译:口音本身可以是地位、自豪感和其他情感信息的传导器,可以通过语言本身获得。 口音本身可以被定义为:“特定地区、国家或社会群体的人如何表达语言”或“特别强调一个词、一个句子中的词或一组乐笔中的音符中的音符中的音调”。 口音识别是语音识别领域最重要的问题之一。 语音识别是计算机科学和语言研究的一个跨学科的次领域,其主要目的是开发能够将语音转换成文字的技术。 语言本身可以被定义为“特定地区、国家或社会群体的人如何表达语言”或“特别强调一个词、一个句子中的词句子或一组乐笔中的音符中的音调”。 这种多样性来自环境条件、演讲者之间的变异性、频道噪音、语言制作的差异、残疾的存在等。 因此,语音识别确实是有待利用的丰富信息源。