The development of pathological speech systems is currently hindered by the lack of a standardised objective evaluation framework. In this work, (1) we utilise existing detection and analysis techniques to propose a general framework for the consistent evaluation of synthetic pathological speech. This framework evaluates the voice quality and the intelligibility aspects of speech and is shown to be complementary using our experiments. (2) Using our proposed evaluation framework, we develop and test a dysarthric voice conversion system (VC) using CycleGAN-VC and a PSOLA-based speech rate modification technique. We show that the developed system is able to synthesise dysarthric speech with different levels of speech intelligibility.
翻译:由于缺乏标准化的客观评价框架,病理语言系统的发展目前受到阻碍,在这项工作中,(1) 我们利用现有的检测和分析技术,为合成病理语言的一致评价提出一个总框架,这一框架评估语言质量和智能方面,并通过我们的实验证明是互补的。 (2) 我们利用我们提议的评价框架,利用CyopleGAN-VC和基于PSOLA的语音率修改技术,开发和测试一个极声转换系统,我们显示,发达系统能够以不同级别的语言感应合成具有发抖能力的言论。