The inclusion of voice persona in synthesized voice can be significant in a broad range of human-computer-interaction (HCI) applications, including augmentative and assistive communication (AAC), artistic performance, and design of virtual agents. We propose a framework to imbue compelling and contextually-dependent expression within a synthesized voice by introducing the role of the vocal persona within a synthesis system. In this framework, the resultant 'tone of voice' is defined as a point existing within a continuous, contextually-dependent probability space that is traversable by the user of the voice. We also present initial findings of a thematic analysis of 10 interviews with vocal studies and performance experts to further understand the role of the vocal persona within a natural communication ecology. The themes identified are then used to inform the design of the aforementioned framework.
翻译:在合成声音中包含声音人,在广泛的人类-计算机互动应用中可能具有重要意义,包括增强和辅助通信、艺术表演和虚拟物剂的设计。我们提议了一个框架,通过在合成系统中引入声音人的作用,将声音人纳入综合声音中,将声音人纳入一个框架。在这个框架内,由此产生的“声音一体”被定义为一个连续的、因地制宜的概率空间中存在的点,该声音的用户可以穿越这一空间。我们还提出对10次访谈进行专题分析的初步结果,这些访谈涉及语音研究和表演专家,以进一步理解声音人在自然通信生态中的作用。然后确定的主题被用来为上述框架的设计提供参考。