In recent years, the task of Automatic Music Transcription (AMT), whereby various attributes of music notes are estimated from audio, has received increasing attention. At the same time, the related task of Multi-Pitch Estimation (MPE) remains a challenging but necessary component of almost all AMT approaches, even if only implicitly. In the context of AMT, pitch information is typically quantized to the nominal pitches of the Western music scale. Even in more general contexts, MPE systems typically produce pitch predictions with some degree of quantization. In certain applications of AMT, such as Guitar Tablature Transcription (GTT), it is more meaningful to estimate continuous-valued pitch contours. Guitar tablature has the capacity to represent various playing techniques, some of which involve pitch modulation. Contemporary approaches to AMT do not adequately address pitch modulation, and offer only less quantization at the expense of more model complexity. In this paper, we present a GTT formulation that estimates continuous-valued pitch contours, grouping them according to their string and fret of origin. We demonstrate that for this task, the proposed method significantly improves the resolution of MPE and simultaneously yields tablature estimation results competitive with baseline models.
翻译:近年来,自动音乐分流(AMT)的任务(自动音乐分流(自动音乐分流)的任务(自动音乐分流的不同属性是从音频中估算的)日益受到重视,同时,多盘曲估测(MPE)的相关任务(多盘曲估测(MPE))仍然是几乎所有自动调控方法中具有挑战性但必要的组成部分,即使只是隐含的。在自动调控中,投球信息通常与西方音乐规模的名义音调量成四分化。即使在更一般的情况下,MPE系统通常会产生某种程度的分化的投影预测。在诸如吉塔调控调(GTTT)等自动调控调的某些应用中,估算连续价值的投影轮廓(GTT)更有意义。Guitar制表(MPE)具有代表各种游戏技术的能力,其中一些涉及投影调。现代投影仪方法并不充分处理投影调问题,而仅提供较少的四分化,而以更复杂的模式为代价。我们介绍了一种GTTT的公式,即估计连续估价的轮廓轮轮,按其弦和胎轮廓排列,按其轮廓进行。我们同时展示了计算,并同时改进了计算了计算结果。