Guitar tablature transcription is an important but understudied problem within the field of music information retrieval. Traditional signal processing approaches offer only limited performance on the task, and there is little acoustic data with transcription labels for training machine learning models. However, guitar transcription labels alone are more widely available in the form of tablature, which is commonly shared among guitarists online. In this work, a collection of symbolic tablature is leveraged to estimate the pairwise likelihood of notes on the guitar. The output layer of a baseline tablature transcription model is reformulated, such that an inhibition loss can be incorporated to discourage the co-activation of unlikely note pairs. This naturally enforces playability constraints for guitar, and yields tablature which is more consistent with the symbolic data used to estimate pairwise likelihoods. With this methodology, we show that symbolic tablature can be used to shape the distribution of a tablature transcription model's predictions, even when little acoustic data is available.
翻译:在音乐信息检索领域,吉他制表是一个重要的问题,但研究不足。传统信号处理方法只能提供有限的工作表现,而且用于培训机器学习模型的转录标签的声学数据很少。然而,吉他制表标签本身以图示形式更为广泛,吉他制表标签在网上通常由吉他手共享。在这项工作中,利用一组象征性制表材料来估计吉他笔记的配对可能性。重订了基准制表单转录模型的输出层,这样就可以吸收抑制性损失来阻止不太可能的音符组合的共动。这自然会强制实施吉他可播放性限制,并产生更加符合用来估计双向可能性的符号数据的表示特征。我们用这种方法表明,即使有微小的声学数据,也可以使用象征性制表符号制表符来决定标签制表模型预测的分布。