This paper presents a geometric approach to pitch estimation (PE)-an important problem in Music Information Retrieval (MIR), and a precursor to a variety of other problems in the field. Though there exist a number of highly-accurate methods, both mono-pitch estimation and multi-pitch estimation (particularly with unspecified polyphonic timbre) prove computationally and conceptually challenging. A number of current techniques, whilst incredibly effective, are not targeted towards eliciting the underlying mathematical structures that underpin the complex musical patterns exhibited by acoustic musical signals. Tackling the approach from both a theoretical and experimental perspective, we present a novel framework, a basis for further work in the area, and results that (whilst not state of the art) demonstrate relative efficacy. The framework presented in this paper opens up a completely new way to tackle PE problems, and may have uses both in traditional analytical approaches, as well as in the emerging machine learning (ML) methods that currently dominate the literature.
翻译:本文介绍了对音频信息检索(MIR)中一个重要问题的投影估测(PE)的几何方法,这是音乐信息检索(MIR)中一个重要的问题,也是该领域其他各种问题的前身。虽然存在一些高度精确的方法,但单切估测和多切估测(特别是未指明的多方音宽)在计算上和概念上都证明具有挑战性。一些现行技术虽然非常有效,但并非针对从理论和实验角度处理这一方法所展示的复杂音乐模式背后的数学结构。我们提出了一个新颖的框架,作为该领域进一步工作的基础,以及显示相对有效性的结果(尽管尚未达到最新水平 ) 。 本文提出的框架开启了解决 PE 问题的全新方法, 并可能在传统分析方法以及目前主导文献的新兴机器学习(ML)方法中使用了这些方法。