图像2Reverb: 跨模式变换器脉冲反应合成 (Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis)

Measuring the acoustic characteristics of a space is often done by capturing its impulse response (IR), a representation of how a full-range stimulus sound excites it. This work generates an IR from a single image, which can then be applied to other signals using convolution, simulating the reverberant characteristics of the space shown in the image. Recording these IRs is both time-intensive and expensive, and often infeasible for inaccessible locations. We use an end-to-end neural network architecture to generate plausible audio impulse responses from single images of acoustic environments. We evaluate our method both by comparisons to ground truth data and by human expert evaluation. We demonstrate our approach by generating plausible impulse responses from diverse settings and formats including well known places, musical halls, rooms in paintings, images from animations and computer games, synthetic environments generated from text, panoramic images, and video conference backgrounds.

翻译：测量空间的声学特性往往通过捕捉其脉冲反应(IR)来进行,这是全程刺激的声振反应的表示。这项工作从一个图像中产生一个IR,然后可以应用到其他信号中,使用卷变,模拟图像中显示的空间的反动特性。录制这些IR,既耗时又昂贵,而且对于无法进入的地点往往不可行。我们使用一个端到端的神经网络结构来从单一的声响环境图像中产生可信的声动反应。我们通过比较地面真实数据和人类专家评估来评估我们的方法。我们展示了我们的方法,从各种环境和格式中产生可信的脉动反应,包括众所周知的地点、音乐厅、绘画室、动画和计算机游戏的图像、文本、全景图像和视频会议背景产生的合成环境。

相关内容

关注 14

信息检索杂志（IR）为信息检索的广泛领域中的理论、算法分析和实验的发布提供了一个国际论坛。感兴趣的主题包括对应用程序（例如Web，社交和流媒体，推荐系统和文本档案）的搜索、索引、分析和评估。这包括对搜索中人为因素的研究、桥接人工智能和信息检索以及特定领域的搜索应用程序。官网地址：https://dblp.uni-trier.de/db/journals/ir/

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

85+阅读 · 2020年2月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日