The approach used not only challenges some of the fundamental mathematical techniques used so far in early experiments of the same trend but also introduces new scopes and new horizons for interesting results. The physics governing spectrograms have been optimized in the project along with exploring how it handles the intense requirements of the problem at hand. Major contributions and developments brought under the light, through this project involve using better mathematical techniques and problem-specific machine learning methods. Improvised data analysis and data augmentation for audio datasets like frequency masking and random frequency-time stretching are used in the project and hence are explained in this paper. In the used methodology, the audio transforms principle were also tried and explored, and indeed the insights gained were used constructively in the later stages of the project. Using a deep learning principle is surely one of them. Also, in this paper, the potential scopes and upcoming research openings in both short and long term tunnel of time has been presented. Although much of the results gained are domain-specific as of now, they are surely potent enough to produce novel solutions in various different domains of diverse backgrounds.
翻译:这一方法不仅挑战了早期试验中迄今在相同趋势中使用的一些基本数学技术,而且还为有趣的结果引入了新的范围和新视野。该项目优化了关于光谱图的物理学,同时探索如何处理手头问题的大量要求。通过该项目带来的重大贡献和发展,涉及使用更好的数学技术和针对具体问题的机器学习方法。该项目使用了对频率掩码和随机频率拉长等音频数据集的简易数据分析和数据增强,因此本文件对此作了解释。在使用的方法中,还尝试和探索了音频转换原则,实际上在项目的后期阶段建设性地利用了所获得的见解。使用深层学习原则肯定是其中之一。此外,本文还介绍了短期和长远的隧道的潜在范围和即将开始的研究。虽然目前取得的结果大部分是特定领域,但它们肯定足以在不同背景的不同领域产生新的解决办法。