GWA:用于音频处理的大型高质量声频数据集 (GWA: A Large High-Quality Acoustic Dataset for Audio Processing)

We present the Geometric-Wave Acoustic (GWA) dataset, a large-scale audio dataset of about 2 million synthetic room impulse responses (IRs) and their corresponding detailed geometric and simulation configurations. Our dataset samples acoustic environments from over 6.8K high-quality diverse and professionally designed houses represented as semantically labeled 3D meshes. We also present a novel real-world acoustic materials assignment scheme based on semantic matching that uses a sentence transformer model. We compute high-quality impulse responses corresponding to accurate low-frequency and high-frequency wave effects by automatically calibrating geometric acoustic ray-tracing with a finite-difference time-domain wave solver. We demonstrate the higher accuracy of our IRs by comparing with recorded IRs from complex real-world environments. Moreover, we highlight the benefits of GWA on audio deep learning tasks such as automated speech recognition, speech enhancement, and speech separation. This dataset is the first data with accurate wave acoustic simulations in complex scenes. Codes and data are available at https://gamma.umd.edu/pro/sound/gwa.

翻译：我们展示了几何-垂直声波(GWA)数据集,这是一个由大约200万合成室脉冲反应(IRs)及其相应的详细几何和模拟配置组成的大型音频数据集。我们的数据集样本来自6.8K以上高质量、多样化和专业设计高品质的室内的声学环境,这些房屋代表了以语义标签为3Dmeshes。我们还展示了一个基于语义匹配的新颖的现实世界声学材料分配方案,它使用一个句子变异器模型。我们通过自动校准一个有限制的地平时空波解调解调器,计算出与准确的低频和高频波效应相对的高质量脉冲反应。我们通过比较复杂的现实环境中的有记录的IRs,展示了我们的IRs更高的准确性。此外,我们强调GWA在语音自动语音识别、语音增强和语音分离等听力深学任务方面的好处。这个数据集是在复杂场面有准确波声学模拟的第一个数据。代码和数据可在 https://gamma.umd.ed/propro/gwound/gwa。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日