空间制图 (Audio Latent Space Cartography) - 专知论文

会员服务 ·

0

潜在 · WEB · 数据集 · 语音学 · 计算学习理论 ·

2022 年 12 月 7 日

Audio Latent Space Cartography

翻译：空间制图

Nicolas Jonason,Bob L. T. Sturm

from arxiv, Late Breaking / Demo, ISMIR 2022 (https://ismir2022program.ismir.net/lbd_413.html)

We explore the generation of visualisations of audio latent spaces using an audio-to-image generation pipeline. We believe this can help with the interpretability of audio latent spaces. We demonstrate a variety of results on the NSynth dataset. A web demo is available.

翻译：我们利用音频到图像生成管道探索声频潜在空间的可视化生成。我们相信这会有助于音频潜在空间的可解释性。我们在NSynth数据集上展示了各种结果。网络演示可供使用。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【图神经网络概览】《Graph Neural Networks - An overview | AI Summer》

【图神经网络概览】《Graph Neural Networks - An overview | AI Summer》

专知会员服务

54+阅读 · 2020年2月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

AMPK-Beclin-1/Vps34通路在维生素D3（Vit D)诱导足细胞自噬中的作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Cirbp在斑马鱼低温适应中的作用与机制研究

国家自然科学基金

1+阅读 · 2013年12月31日

顶点算子代数理论及李代数的表示

国家自然科学基金

1+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

新型靶向VPAC1高亲和力多肽的结直肠癌分子显像应用基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

移动视觉搜索关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

野油菜黄单胞菌全局性转录调控因子HpaR1致病的分子机理

国家自然科学基金

0+阅读 · 2011年12月31日

适应云计算环境的视频编码、传输与智能处理

国家自然科学基金

0+阅读 · 2011年12月31日

数字图像复原大规模问题的高性能算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

Shortcut Detection with Variational Autoencoders

Arxiv

0+阅读 · 2023年2月8日

CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models

Arxiv

0+阅读 · 2023年2月8日

Generating Synthetic Speech from SpokenVocab for Speech Translation

Arxiv

0+阅读 · 2023年2月8日

Explainable Action Prediction through Self-Supervision on Scene Graphs

Arxiv

0+阅读 · 2023年2月7日

High-Resolution GAN Inversion for Degraded Images in Large Diverse Datasets

Arxiv

0+阅读 · 2023年2月7日

Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation

Arxiv

0+阅读 · 2023年2月7日

Federated Variational Inference Methods for Structured Latent Variable Models

Arxiv

0+阅读 · 2023年2月7日

Adaptive Perturbation-Based Gradient Estimation for Discrete Latent Variable Models

Arxiv

0+阅读 · 2023年2月5日

Deep Latent State Space Models for Time-Series Generation

Arxiv

0+阅读 · 2023年2月3日

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Arxiv

19+阅读 · 2020年3月31日

VIP会员

文章信息

相关主题

计算学习理论

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【图神经网络概览】《Graph Neural Networks - An overview | AI Summer》

【图神经网络概览】《Graph Neural Networks - An overview | AI Summer》

专知会员服务

54+阅读 · 2020年2月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

NeurIPS 2025 | 自动化所新作速览（一）

大型语言模型（LLM）赋能的知识图谱构建：综述

NeurIPS 2025 | 自动化所新作速览（二）

领域特定文本分类中的预训练语言模型新进展：系统综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

Shortcut Detection with Variational Autoencoders

Arxiv

0+阅读 · 2023年2月8日

CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models

Arxiv

0+阅读 · 2023年2月8日

Generating Synthetic Speech from SpokenVocab for Speech Translation

Arxiv

0+阅读 · 2023年2月8日

Explainable Action Prediction through Self-Supervision on Scene Graphs

Arxiv

0+阅读 · 2023年2月7日

High-Resolution GAN Inversion for Degraded Images in Large Diverse Datasets

Arxiv

0+阅读 · 2023年2月7日

Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation

Arxiv

0+阅读 · 2023年2月7日

Federated Variational Inference Methods for Structured Latent Variable Models

Arxiv

0+阅读 · 2023年2月7日

Adaptive Perturbation-Based Gradient Estimation for Discrete Latent Variable Models

Arxiv

0+阅读 · 2023年2月5日

Deep Latent State Space Models for Time-Series Generation

Arxiv

0+阅读 · 2023年2月3日

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Arxiv

19+阅读 · 2020年3月31日

相关基金

AMPK-Beclin-1/Vps34通路在维生素D3（Vit D)诱导足细胞自噬中的作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Cirbp在斑马鱼低温适应中的作用与机制研究

国家自然科学基金

1+阅读 · 2013年12月31日

顶点算子代数理论及李代数的表示

国家自然科学基金

1+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

新型靶向VPAC1高亲和力多肽的结直肠癌分子显像应用基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

移动视觉搜索关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

野油菜黄单胞菌全局性转录调控因子HpaR1致病的分子机理

国家自然科学基金

0+阅读 · 2011年12月31日

适应云计算环境的视频编码、传输与智能处理

国家自然科学基金

0+阅读 · 2011年12月31日

数字图像复原大规模问题的高性能算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员