Soundata is a Python library for loading and working with audio datasets in a standardized way, removing the need for writing custom loaders in every project, and improving reproducibility by providing tools to validate data against a canonical version. It speeds up research pipelines by allowing users to quickly download a dataset, load it into memory in a standardized and reproducible way, validate that the dataset is complete and correct, and more. Soundata is based and inspired on mirdata and design to complement mirdata by working with environmental sound, bioacoustic and speech datasets, among others. Soundata was created to be easy to use, easy to contribute to, and to increase reproducibility and standardize usage of sound datasets in a flexible way.
翻译:Soundata是一个Python图书馆,用于以标准化的方式装载和操作音频数据集,消除每个项目对自定义装货机的需要,并通过提供工具,对照卡通版本验证数据,改善再复制性。它加速研究管道,使用户能够快速下载数据集,以标准化和可复制的方式将数据集装入记忆,验证数据集是完整和正确的,而且更多。Soundata以虚拟数据和设计为基础,并受到启发,通过与无害环境、生物声学和语音数据集等合作,补充虚拟数据。创造Soundata是为了便于使用、容易促进和更加灵活地复制和统一使用声音数据集。