In the context of building acoustics and the acoustic diagnosis of an existing room, this paper introduces and investigates a new approach to estimate mean absorption coefficients solely from a room impulse response (RIR). This inverse problem is tackled via virtually-supervised learning, namely, the RIR-to-absorption mapping is implicitly learned by regression on a simulated dataset using artificial neural networks. We focus on simple models based on well-understood architectures. The critical choices of geometric, acoustic and simulation parameters used to train the models are extensively discussed and studied, while keeping in mind conditions that are representative of the field of building acoustics. Estimation errors from the learned neural models are compared to those obtained with classical formulas that require knowledge of the room's geometry and reverberation times. Extensive comparisons made on a variety of simulated test sets highlight different conditions under which the learned models can overcome the well-known limitations of the diffuse sound field hypothesis underlying these formulas. Results obtained on real RIRs measured in an acoustically configurable room show that at 1~kHz and above, the proposed approach performs comparably to classical models when reverberation times can be reliably estimated, and continues to work even when they cannot.
翻译:在建设声学和对现有房间进行声学分析的背景下,本文件介绍并调查了一种新办法,即仅从室冲动反应(RIR)中估计平均吸收系数。这一逆向问题通过几乎由监督的学习来解决,即通过人工神经网络在模拟数据集的回归中隐含地学习RIR到吸收映射图,我们注重基于深层建筑的简单模型;广泛讨论和研究用于培训模型的几何、声学和模拟参数的关键选择,同时牢记具有建立声学领域代表性的条件。从所学的神经模型中得出的估计误差与古典公式中获得的误差相比较,古典公式需要了解房间的几何和回校时间。对各种模拟测试组进行的广泛比较,突出各种条件,使所学模型能够克服这些公式所基于的分散的声场假设的众所周知的局限性。在声学可调可调的室内测量到的真RIR结果显示,即使在1~赫兹和以上,所学神经模型的拟议方法也无法持续可靠地进行时间到古典模型。