使用对部分缺失通道进行多通道观测的声频场分类 (Acoustic Scene Classification Using Multichannel Observation with Partially Missing Channels) - 专知论文

会员服务 ·

0

Performer · 通道 · 数据增强 · INFORMS · SimPLe ·

2021 年 5 月 5 日

Acoustic Scene Classification Using Multichannel Observation with Partially Missing Channels

翻译：使用对部分缺失通道进行多通道观测的声频场分类

from arxiv, Accepted to EUSIPCO2021

Sounds recorded with smartphones or IoT devices often have partially unreliable observations caused by clipping, wind noise, and completely missing parts due to microphone failure and packet loss in data transmission over the network. In this paper, we investigate the impact of the partially missing channels on the performance of acoustic scene classification using multichannel audio recordings, especially for a distributed microphone array. Missing observations cause not only losses of time-frequency and spatial information on sound sources but also a mismatch between a trained model and evaluation data. We thus investigate how a missing channel affects the performance of acoustic scene classification in detail. We also propose simple data augmentation methods for scene classification using multichannel observations with partially missing channels and evaluate the scene classification performance using the data augmentation methods.

翻译：在本文中,我们调查部分缺失的频道对使用多声道录音进行声学现场分类的效果的影响,特别是对分布式麦克风阵列的影响。缺失的观测不仅造成音频和空间信息损失,而且造成经过训练的模型和评估数据之间的不匹配。我们因此调查缺少的频道如何影响声学现场分类的详细性能。我们还提议使用部分缺失的多声道观测进行现场分类的简单数据增强方法,并利用数据扩增方法评估现场分类的性能。

0

相关内容

Performer

移动数字广告与互联网反欺诈蓝皮报告

移动数字广告与互联网反欺诈蓝皮报告

专知会员服务

28+阅读 · 2021年5月13日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

图像分类半监督自监督无监督学习综述，A survey on Semi-, Self- and Unsupervised Learning for Image Classification

图像分类半监督自监督无监督学习综述，A survey on Semi-, Self- and Unsupervised Learning for Image Classification

专知会员服务

46+阅读 · 2020年7月29日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

【论文推荐】Short Text Classiﬁcation via Term Graph 基于术语图的短文本分类

【论文推荐】Short Text Classiﬁcation via Term Graph 基于术语图的短文本分类

专知会员服务

20+阅读 · 2020年1月20日

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

专知会员服务

14+阅读 · 2020年1月1日

【ECML-PKDD 2019】带歧义的分类变量编码（Encoding Categorical Variables with Ambiguity）

【ECML-PKDD 2019】带歧义的分类变量编码（Encoding Categorical Variables with Ambiguity）

专知会员服务

5+阅读 · 2019年12月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

已删除

将门创投

7+阅读 · 2017年7月11日

Bayesian Spanning Tree: Estimating the Backbone of the Dependence Graph

Arxiv

0+阅读 · 2021年6月30日

Effect of acoustic scene complexity and visual scene representation on auditory perception in virtual audio-visual environments

Arxiv

0+阅读 · 2021年6月30日

Inductive Bias of Multi-Channel Linear Convolutional Networks with Bounded Weight Norm

Arxiv

0+阅读 · 2021年6月28日

Modelling High-Dimensional Categorical Data Using Nonconvex Fusion Penalties

Arxiv

0+阅读 · 2021年6月28日

Dealing with training and test segmentation mismatch: FBK@IWSLT2021

Arxiv

0+阅读 · 2021年6月28日

Change-Point Detection in Dynamic Networks with Missing Links

Arxiv

0+阅读 · 2021年6月28日

Improving Grasp Planning Efficiency with Human Grasp Tendencies*

Arxiv

0+阅读 · 2021年6月27日

Multi-Source Neural Machine Translation with Missing Data

Arxiv

5+阅读 · 2018年6月7日

Topic Modelling of Empirical Text Corpora: Validity, Reliability, and Reproducibility in Comparison to Semantic Maps

Arxiv

4+阅读 · 2018年6月4日

MR image reconstruction using deep density priors

Arxiv

5+阅读 · 2018年1月17日

VIP会员

文章信息

相关主题

相关VIP内容

移动数字广告与互联网反欺诈蓝皮报告

移动数字广告与互联网反欺诈蓝皮报告

专知会员服务

28+阅读 · 2021年5月13日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

图像分类半监督自监督无监督学习综述，A survey on Semi-, Self- and Unsupervised Learning for Image Classification

图像分类半监督自监督无监督学习综述，A survey on Semi-, Self- and Unsupervised Learning for Image Classification

专知会员服务

46+阅读 · 2020年7月29日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

【论文推荐】Short Text Classiﬁcation via Term Graph 基于术语图的短文本分类

【论文推荐】Short Text Classiﬁcation via Term Graph 基于术语图的短文本分类

专知会员服务

20+阅读 · 2020年1月20日

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

【论文】深度卷积神经网络的ImageNet分类（ImageNet Classification with Deep Convolutional Neural Networks）

专知会员服务

14+阅读 · 2020年1月1日

【ECML-PKDD 2019】带歧义的分类变量编码（Encoding Categorical Variables with Ambiguity）

【ECML-PKDD 2019】带歧义的分类变量编码（Encoding Categorical Variables with Ambiguity）

专知会员服务

5+阅读 · 2019年12月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS 2025】稳定电影度量：面向专业视频生成的结构化分类与评测体系

战场AI决策支持系统

【博士论文】面向排序与扩散模型的安全、高效与鲁棒强化学习

面向 AI 生成图像的安全与鲁棒水印：全面综述

相关资讯

已删除

将门创投

7+阅读 · 2017年7月11日

相关论文

Bayesian Spanning Tree: Estimating the Backbone of the Dependence Graph

Arxiv

0+阅读 · 2021年6月30日

Effect of acoustic scene complexity and visual scene representation on auditory perception in virtual audio-visual environments

Arxiv

0+阅读 · 2021年6月30日

Inductive Bias of Multi-Channel Linear Convolutional Networks with Bounded Weight Norm

Arxiv

0+阅读 · 2021年6月28日

Modelling High-Dimensional Categorical Data Using Nonconvex Fusion Penalties

Arxiv

0+阅读 · 2021年6月28日

Dealing with training and test segmentation mismatch: FBK@IWSLT2021

Arxiv

0+阅读 · 2021年6月28日

Change-Point Detection in Dynamic Networks with Missing Links

Arxiv

0+阅读 · 2021年6月28日

Improving Grasp Planning Efficiency with Human Grasp Tendencies*

Arxiv

0+阅读 · 2021年6月27日

Multi-Source Neural Machine Translation with Missing Data

Arxiv

5+阅读 · 2018年6月7日

Topic Modelling of Empirical Text Corpora: Validity, Reliability, and Reproducibility in Comparison to Semantic Maps

Arxiv

4+阅读 · 2018年6月4日

MR image reconstruction using deep density priors

Arxiv

5+阅读 · 2018年1月17日

微信扫码咨询专知VIP会员