A new impulse response (IR) dataset called "MeshRIR" is introduced. Currently available datasets usually include IRs at an array of microphones from several source positions under various room conditions, which are basically designed for evaluating speech enhancement and distant speech recognition methods. On the other hand, methods of estimating or controlling spatial sound fields have been extensively investigated in recent years; however, the current IR datasets are not applicable to validating and comparing these methods because of the low spatial resolution of measurement points. MeshRIR consists of IRs measured at positions obtained by finely discretizing a spatial region. Two subdatasets are currently available: one consists of IRs in a three-dimensional cuboidal region from a single source, and the other consists of IRs in a two-dimensional square region from an array of 32 sources. Therefore, MeshRIR is suitable for evaluating sound field analysis and synthesis methods. This dataset is freely available at \url{https://sh01k.github.io/MeshRIR/} with some codes of sample applications.
翻译:引入了新的脉冲反应数据集(IR),称为“MeshRIR”,目前可用的数据集通常包括来自不同房间条件下若干源位置的多个麦克风阵列的IRs,基本上设计用于评价语音增强和远程语音识别方法;另一方面,近年来广泛调查了估计或控制空间声域的方法;然而,由于测量点的空间分辨率低,目前的IR数据集不适用于验证和比较这些方法。MeshRIR由通过空间区域微分分化获得的位置测量的IRs组成。目前有两套子数据集:其中一套由单一源的三维立骨区域IRs组成,另一套由32个来源的二维平方区域IRs组成。因此,MeshrRIR不适合用于评价声音场分析和合成方法。该数据集可在以下https://sh01k.github.io/MeshRIR/}免费查阅,并有一些样本应用代码。