Previous research has found that voices can provide reliable information for gender classification with a high level of accuracy. In social psychology, perceived vocal masculinity and femininity has often been considered as an important feature on social behaviours. While previous studies have characterised acoustic features that contributed to perceivers' judgements of speakers' vocal masculinity or femininity, there is limited research on building an objective masculinity/femininity scoring model and characterizing the independent acoustic factors that contribute to the judgements of speakers' vocal masculinity or femininity. In this work, we firstly propose an objective masculinity/femininity scoring system based on the Extreme Random Forest and then characterize the independent and meaningful acoustic factors contributing to perceivers' judgements by using a correlation matrix based hierarchical clustering method. The results show the objective masculinity/femininity ratings strongly correlated with the perceived masculinity/femininity ratings when we used an optimal speech duration of 7 seconds, with a correlation coefficient of up to .63 for females and .77 for males. 9 independent clusters of acoustic measures were generated from our modelling of femininity judgements for female voices and 8 clusters were found for masculinity judgements for male voices. The results revealed that, for both sexes, the F0 mean is the most critical acoustic measure affects the judgement of vocal masculinity and femininity. The F3 mean, F4 mean and VTL estimators are found to be highly inter-correlated and appeared in the same cluster, forming the second significant factor. Next, F1 mean, F2 mean and F0 standard deviation are independent factors that share similar importance. The voice perturbation measures, including HNR, jitter and shimmer, are of lesser importance.
翻译:先前的研究发现,声音能够以高度准确性为性别分类提供可靠的信息。在社会心理学中,人们所认为的男性性和女性性通常被视为社会行为的一个重要特征。虽然以前的研究具有声学特征,有助于人们判断演讲者声音男性性或女性性,但对于建立客观的男性性/女性性评分模式和描述独立声学因素,有助于判断演讲者声音男性性或女性性的判断。在社会心理学中,我们首先建议以极端随机森林为基础建立客观的男性性/女性性格和女性性格评分系统,然后通过使用基于等级组合法的关联矩阵,来描述有助于人们判断者判断的独立和有意义的声学因素。 研究结果显示,在使用最优发言时间为7秒钟时,雄性/女性性评分的评分与女性之间的评分非常不相近,女性和男性之间的相对性评分值高达.63,男性的相对性评分值为.77。 女性的判断力和男性性判分数的9个独立组结果显示,包括女性性判分数的分数为女性性。