The task of efficient automatic music classification is of vital importance and forms the basis for various advanced applications of AI in the musical domain. Musical instrument recognition is the task of instrument identification by virtue of its audio. This audio, also termed as the sound vibrations are leveraged by the model to match with the instrument classes. In this paper, we use an artificial neural network (ANN) model that was trained to perform classification on twenty different classes of musical instruments. Here we use use only the mel-frequency cepstral coefficients (MFCCs) of the audio data. Our proposed model trains on the full London philharmonic orchestra dataset which contains twenty classes of instruments belonging to the four families viz. woodwinds, brass, percussion, and strings. Based on experimental results our model achieves state-of-the-art accuracy on the same.
翻译:高效自动音乐分类的任务至关重要,是AI在音乐领域各种先进应用的基础。音乐仪器的识别是借助其音频进行仪器识别的任务。这种音频(也称为声音振动)被模型与仪器类别相匹配。在本文中,我们使用一个人工神经网络(ANN)模型,该模型经过培训,可以对20种不同的乐器进行分类。在这里,我们只使用音频数据中流频中流系数(MFCCs)。我们提议的伦敦交响乐乐团全数据集模型包含属于四个家庭(木风、黄铜、冲击和弦等)的20类仪器。根据实验结果,我们模型实现了相同的最新精确度。