Melody estimation or melody extraction refers to the extraction of the primary or fundamental dominant frequency in a melody. This sequence of frequencies obtained represents the pitch of the dominant melodic line from recorded music audio signals. The music signal may be monophonic or polyphonic. The melody extraction problem from audio signals gets complicated when we start dealing with polyphonic audio data. This is because in generalized audio signals,the sounds are highly correlated over both frequency and time domains. This complex overlap of many sounds, makes identification of predominant frequency challenging.
翻译:Melody 估计或旋律提取是指在旋律中提取主要或基本主要频率。 获得的频率序列代表了音乐音频信号中占主导地位的旋律线的定位。 音乐信号可以是单声波或多声波。 当我们开始处理多音音频数据时, 音频信号中的旋律提取问题会变得复杂。 这是因为在一般的音频信号中, 声音在频率和时间域上都高度相关。 许多声音的复杂重叠使得主要频率的识别具有挑战性。