Piano fingering -- knowing which finger to use to play each note in a musical piece, is a hard and important skill to master when learning to play the piano. While some sheet music is available with expert-annotated fingering information, most pieces lack this information, and people often resort to learning the fingering from demonstrations in online videos. We consider the AI task of automating the extraction of fingering information from videos. This is a non-trivial task as fingers are often occluded by other fingers, and it is often not clear from the video which of the keys were pressed, requiring the synchronization of hand position information and knowledge about the notes that were played. We show how to perform this task with high-accuracy using a combination of deep-learning modules, including a GAN-based approach for fine-tuning on out-of-domain data. We extract the fingering information with an f1 score of 97\%. We run the resulting system on 90 videos, resulting in high-quality piano fingering information of 150K notes, the largest available dataset of piano-fingering to date.
翻译:钢琴指法 -- -- 知道用哪根手指来弹奏音乐片中的每个音符,是学习弹钢琴时掌握的艰难而重要的技能。虽然有些乐谱有专家附加的指法信息,但大多数乐谱缺乏这种信息,而且人们经常在在线视频中从演示中学习指法。我们认为AI的任务是将从视频中提取的指法信息自动化。这是一个非三角任务,因为手指往往被其他手指所遮住,而且从视频中往往不清楚按键的哪个键,这需要手持位置信息的同步和对所播放的笔记的了解。我们展示了如何使用深层学习模块(包括基于GAN的对外部数据进行微调的GAN方法)来以高精度执行这项任务。我们用97<unk> 的F1分提取指法信息。我们在90个视频上运行由此产生的系统,导致150K音调高品质的钢琴指针信息,这是迄今为止最大的钢琴挥动数据集。</s>