Can a computer determine a piano player's skill level? Is it preferable to base this assessment on visual analysis of the player's performance or should we trust our ears over our eyes? Since current CNNs have difficulty processing long video videos, how can shorter clips be sampled to best reflect the players skill level? In this work, we collect and release a first-of-its-kind dataset for multimodal skill assessment focusing on assessing piano player's skill level, answer the asked questions, initiate work in automated evaluation of piano playing skills and provide baselines for future work. Dataset is available from: https://github.com/ParitoshParmar/Piano-Skills-Assessment.
翻译:计算机能决定钢琴演奏者的技能水平吗? 是否应该将这种评估建立在对钢琴演奏者表现的视觉分析的基础上? 还是我们应该相信我们的耳朵在我们的眼中? 由于目前的CNN公司难以处理长长的视频视频,如何能对短的剪辑进行抽样,以最好地反映运动员的技能水平? 在这项工作中,我们收集和发布一套用于多式技能评估的首创数据集,重点是评估钢琴演奏者的技能水平,回答问题,开始对钢琴演奏技巧进行自动评价,并为今后的工作提供基线。 数据集可从以下网址获得:https://github.com/PartoshParmar/Pian-Skills-asserview。