This paper proposes a new procedure to detect Glottal Closure and Opening Instants (GCIs and GOIs) directly from speech waveforms. The procedure is divided into two successive steps. First a mean-based signal is computed, and intervals where speech events are expected to occur are extracted from it. Secondly, at each interval a precise position of the speech event is assigned by locating a discontinuity in the Linear Prediction residual. The proposed method is compared to the DYPSA algorithm on the CMU ARCTIC database. A significant improvement as well as a better noise robustness are reported. Besides, results of GOI identification accuracy are promising for the glottal source characterization.
翻译:本文件建议采用新的程序,直接从语音波形中探测Glottal 关闭和开关装置(GCIs和GOIs),该程序分为两个连续步骤。首先,计算平均信号,从中抽取预期发生演讲事件的间隔。其次,通过在线性预测残留物中定位不连续的间隔,确定演讲活动的确切位置。提议的方法与CMU ARCTIC数据库中的DYPSA算法进行比较。报告有重大改进,并报告噪音稳健性更好。此外,GOI的识别准确性对Glottal 源的特性很有希望。