Sentiment analysis, also called opinion mining, is the field of study that analyzes people's opinions,sentiments, attitudes and emotions. Songs are important to sentiment analysis since the songs and mood are mutually dependent on each other. Based on the selected song it becomes easy to find the mood of the listener, in future it can be used for recommendation. The song lyric is a rich source of datasets containing words that are helpful in analysis and classification of sentiments generated from it. Now a days we observe a lot of inter-sentential and intra-sentential code-mixing in songs which has a varying impact on audience. To study this impact we created a Telugu songs dataset which contained both Telugu-English code-mixed and pure Telugu songs. In this paper, we classify the songs based on its arousal as exciting or non-exciting. We develop a language identification tool and introduce code-mixing features obtained from it as additional features. Our system with these additional features attains 4-5% accuracy greater than traditional approaches on our dataset.
翻译:感官分析,也称为“ 见解挖掘”,是分析人们意见、感官、态度和情感的研究领域。 歌曲对于情绪分析很重要,因为歌曲和情绪是相互依存的。 根据所选歌曲,很容易找到听众的情绪,将来可以用作建议。 歌词是丰富的数据集来源,包含有助于分析和分类从歌中产生的情感的词句。 现在,我们观察了在歌曲中大量流传和流传的代码混合,对观众有不同影响。 为了研究这一影响,我们创建了特卢古歌曲数据集,其中包括泰卢古-英语混合的和纯特卢古的歌曲。在本文中,我们根据歌曲的发音将歌曲分类为令人兴奋或不激动的歌曲。我们开发了一种语言识别工具,并引入了从中获取的代码混合特征,作为额外的特征。我们拥有这些特性的系统比我们数据集的传统方法更精准4-5%。