To date, various speech technology systems have adopted the vocoder approach, a method for synthesizing speech waveform that shows a major role in the performance of statistical parametric speech synthesis. WaveNet one of the best models that nearly resembles the human voice, has to generate a waveform in a time consuming sequential manner with an extremely complex structure of its neural networks.
翻译:迄今为止,各种语言技术系统都采用了电动电动电动法,这是一种合成语音波形的方法,在统计参数语音合成工作中发挥了主要作用。 WaveNet是接近于人类声音的最佳模型之一,它必须以耗时的顺序方式生成波形,其神经网络结构极其复杂。