Music is a mysterious language that conveys feeling and thoughts via different tones and timbre. For better understanding of timbre in music, we chose music data of 6 representative instruments, analysed their timbre features and classified them. Instead of the current trend of Neural Network for black-box classification, our project is based on a combination of MFCC and LPC, and augmented with a 6-dimensional feature vector designed by ourselves from observation and attempts. In our white-box model, we observed significant patterns of sound that distinguish different timbres, and discovered some connection between objective data and subjective senses. With a totally 32-dimensional feature vector and a naive all-pairs SVM, we achieved improved classification accuracy compared to a single tool. We also attempted to analyze music pieces downloaded from the Internet, found out different performance on different instruments, explored the reasons and suggested possible ways to improve the performance.
翻译:音乐是一种神秘的语言,通过不同的调子和调子传达感觉和思想。为了更好地了解音乐中的调子,我们选择了6个有代表性的乐器的音乐数据,分析了其色调特征并将其分类。我们的项目不是目前黑盒分类的神经网络趋势,而是以MFCC和LPC的组合为基础,并以我们从观察和尝试中设计出来的六维特质矢量作为补充。在我们的白盒模型中,我们观察到了区分不同色调的重要声音模式,发现了客观数据和主观感之间的某种联系。我们有了完全32维特质矢量和天真的全孔SVM,我们实现了与单一工具相比的分类准确性。我们还试图分析从互联网下载的音乐片,发现不同仪器的不同性能,探索了各种原因并提出改进性能的可能方法。