Many speech and music analysis and processing schemes rely on an estimate of the fundamental frequency $f_0$ of periodic signal components. Most established schemes apply rather unspecific signal models such as sinusoidal models to the estimation problem, which may limit time resolution and estimation accuracy. This study proposes a novel time-domain locked-loop algorithm with low computational effort and low memory footprint for $f_0$ estimation. The loop control signal is directly derived from the input time signal, using a harmonic signal model. Theoretically, this allows for a noise-robust and rapid $f_0$ estimation for periodic signals of arbitrary waveform, and without the requirement of a prior frequency analysis. Several simulations with short signals employing different types of periodicity and with added wide-band noise were performed to demonstrate and evaluate the basic properties of the proposed algorithm. Depending on the Signal-to-Noise Ratio (SNR), the estimator was found to converge within 3-4 signal repetitions, even at SNR close to or below 0dB. Furthermore, it was found to follow fundamental frequency sweeps with a delay of less than one period and to track all tones of a three-tone musical chord signal simultaneously. Quasi-periodic sounds with shifted harmonics as well as signals with stochastic periodicity were robustly tracked. Mean and standard deviation of the estimation error, i.e., the difference between true and estimated $f_0$, were at or below 1 Hz in most cases. The results suggest that the proposed algorithm may be applicable to low-delay speech and music analysis and processing.
翻译:许多言语和音乐分析及处理计划依赖于对定期信号组件基本频率的估计数,即0.0美元。大多数既定计划都对估算问题采用相当不具体的信号模型,如类流模型,这可能会限制时间分辨率和估计准确性。本研究提出了一个新的时间-域锁定环流算法,其计算努力低,记忆足迹低,估计值为0.0美元。循环控制信号直接来自输入时间信号,使用一个和谐信号模型。理论上,这允许对任意波形的定期信号进行噪音-紫色和快速的0.0美元估算,而无需事先的频率分析。进行了一些短信号模拟,使用不同周期和增加宽频度的信号,以显示和评估拟议算法的基本性质。根据信号到噪音比率(SNR),估计信号在3-4信号重复中,即使是接近或低于0dB。此外,还发现它跟踪基本频率扫荡的频率,延迟不到1个时间段或低于1度的语音波状信号,在3度和1度的轨道上显示最稳度和最稳度的信号,在1度的轨道上,在1度和最稳度的轨道上显示,在1度和最稳度之间,在1度和最稳度的轨道上的测距之间,估计是稳定的信号和最精确的频率分析。