Cadences are complex structures that have been driving music from the beginning of contrapuntal polyphony until today. Detecting such structures is vital for numerous MIR tasks such as musicological analysis, key detection, or music segmentation. However, automatic cadence detection remains challenging mainly because it involves a combination of high-level musical elements like harmony, voice leading, and rhythm. In this work, we present a graph representation of symbolic scores as an intermediate means to solve the cadence detection task. We approach cadence detection as an imbalanced node classification problem using a Graph Convolutional Network. We obtain results that are roughly on par with the state of the art, and we present a model capable of making predictions at multiple levels of granularity, from individual notes to beats, thanks to the fine-grained, note-by-note representation. Moreover, our experiments suggest that graph convolution can learn non-local features that assist in cadence detection, freeing us from the need of having to devise specialized features that encode non-local context. We argue that this general approach to modeling musical scores and classification tasks has a number of potential advantages, beyond the specific recognition task presented here.
翻译:Cadences是推动音乐的复杂结构,从相反的多元曲的开始到今天一直驱动着音乐。检测这种结构对于音乐学分析、关键检测或音乐分化等许多MIIR任务至关重要。然而,自动的candence 探测仍然具有挑战性,主要是因为它涉及诸如和谐、声音引导和节奏等高级音乐元素的组合。在这项工作中,我们用图示来表示象征性的分数,作为解决感应检测任务的中间手段。我们使用图象革命网络,将声调检测视为不平衡的节点分类问题。我们取得了与艺术状态大致相当的结果,我们展示了一种模型,能够从单调到节奏的多个层次作出预测。此外,我们的实验表明,图形演化可以学习有助于探测声调的非本地特征,使我们不必设计非本地背景的专用特征。我们说,这种模拟音乐分数和分数和分级任务的一般方法具有一定的优势。我们在这里提出一个具体认识任务。