This paper describes a novel Deep Learning method for the design of IIR parametric filters for automatic audio equalization. A simple and effective neural architecture, named BiasNet, is proposed to determine the IIR equalizer parameters. An output denormalization technique is used to obtain accurate tuning of the IIR filters center frequency, quality factor and gain. All layers involved in the proposed method are shown to be differentiable, allowing backpropagation to optimize the network weights and achieve, after a number of training iterations, the optimal output. The parameters are optimized with respect to a loss function based on a spectral distance between the measured and desired magnitude response, and a regularization term used to achieve a spatialization of the acoustc scene. Two scenarios with different characteristics were considered for the experimental evaluation: a room and a car cabin. The performance of the proposed method improves over the baseline techniques and achieves an almost flat band. Moreover IIR filters provide a consistently lower computational cost during runtime with respect to FIR filters.
翻译:本文描述了设计用于自动音量均匀的IR参数过滤器的新型深层学习方法。提议建立一个简单有效的神经结构,名为BiasNet,以决定IR平准参数。输出解正技术用于对IR过滤器中心频率、质量因子和收益进行精确的调整。拟议方法所涉及的所有层次都显示是不同的,允许后向转换,以优化网络重量,并在经过一些培训的迭代后达到最佳输出。根据测量和理想数量响应之间的光谱距离和用于实现焦距空间化的常规化术语,这些参数得到优化。实验评估考虑了两种具有不同特点的情景:一个房间和一个汽车舱。拟议方法的性能比基线技术有所改进,并达到一个几乎固定的波段。此外,IR过滤器在运行期间对FIR过滤器的计算成本始终较低。