Harsh vocal effects such as screams or growls are far more common in heavy metal vocals than the traditionally sung vocal. This paper explores the problem of detection and classification of extreme vocal techniques in heavy metal music, specifically the identification of different scream techniques. We investigate the suitability of various feature representations, including cepstral, spectral, and temporal features as input representations for classification. The main contributions of this work are (i) a manually annotated dataset comprised of over 280 minutes of heavy metal songs of various genres with a statistical analysis of occurrences of different extreme vocal techniques in heavy metal music, and (ii) a systematic study of different input feature representations for the classification of heavy metal vocals
翻译:本文探讨重金属音乐中极端声音技术的探测和分类问题,特别是确定不同的尖叫技术。我们调查各种地物表现的适宜性,包括 ⁇ 、光谱和时间特征,作为分类的输入表示。 这项工作的主要贡献是:(一) 一个人工附加说明的数据集,由各族类重金属歌曲280多分钟组成,对重金属音乐中不同极端声音技术的发生情况进行统计分析;(二) 系统研究用于重金属音分类的不同输入特征表现。