托尔赫-奥迪奥:用于音频和语音处理的建筑构件 (TorchAudio: Building Blocks for Audio and Speech Processing) - 专知论文

会员服务 ·

0

块 · Processing（编程语言） · Machine Learning · Performer · MoDELS ·

2022 年 2 月 16 日

TorchAudio: Building Blocks for Audio and Speech Processing

翻译：托尔赫-奥迪奥:用于音频和语音处理的建筑构件

Yao-Yuan Yang,Moto Hira,Zhaoheng Ni,Anjali Chourdia,Artyom Astafurov,Caroline Chen,Ching-Feng Yeh,Christian Puhrsch,David Pollack,Dmitriy Genzel,Donny Greenberg,Edward Z. Yang,Jason Lian,Jay Mahadeokar,Jeff Hwang,Ji Chen,Peter Goldsborough,Prabhat Roy,Sean Narenthiran,Shinji Watanabe,Soumith Chintala,Vincent Quenneville-Bélair,Yangyang Shi

from arxiv, Accepted by ICASSP 2022

This document describes version 0.10 of TorchAudio: building blocks for machine learning applications in the audio and speech processing domain. The objective of TorchAudio is to accelerate the development and deployment of machine learning applications for researchers and engineers by providing off-the-shelf building blocks. The building blocks are designed to be GPU-compatible, automatically differentiable, and production-ready. TorchAudio can be easily installed from Python Package Index repository and the source code is publicly available under a BSD-2-Clause License (as of September 2021) at https://github.com/pytorch/audio. In this document, we provide an overview of the design principles, functionalities, and benchmarks of TorchAudio. We also benchmark our implementation of several audio and speech operations and models. We verify through the benchmarks that our implementations of various operations and models are valid and perform similarly to other publicly available implementations.

翻译：本文件描述了TrchAudio的0.10版版本:音频和语音处理领域的机器学习应用程序的构件。TrchAudio的目标是通过提供现成的构件加速开发和部署研究人员和工程师的机器学习应用程序。这些构件的设计设计是GPU兼容、自动区分和可制作。TrchAudio可以很容易地从Python Info Invironment 数据库安装,源代码可在BSD-2-Clause许可证下公开查阅(截至2021年9月),网址是https://github.com/pytorch/audio。在本文件中,我们概述了TrchAudio的设计原则、功能和基准。我们还对若干音频操作和模型的实施进行了基准。我们通过基准核查各种操作和模型的实施是否有效,并与其他可公开使用的实施类似。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【2022新书】Transformer自然语言处理，Natural Language Processing with Transformers: Building Language Applications with Hugging Face

【2022新书】Transformer自然语言处理，Natural Language Processing with Transformers: Building Language Applications with Hugging Face

专知会员服务

523+阅读 · 2022年1月31日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

靶向代谢型谷氨酸受体8变构调节位点的3DShapeSim药物发现与镇痛作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

近藤格点系统中的自旋轨道耦合和拓扑量子态

国家自然科学基金

0+阅读 · 2014年12月31日

结合连锁和关联分析方法剖析玉米硝酸还原酶的遗传机制

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

低秩矩阵复原的Schatten-q(0<q<1)正则化理论与算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

Ce-Pb-Sb-Te体系中相关相图及热物理性能的研究

国家自然科学基金

1+阅读 · 2012年12月31日

面向智能视频监控的高度多摄像机信息融合

国家自然科学基金

2+阅读 · 2009年12月31日

ROS在细菌抗生素应激中的作用及与毒素-抗毒素系统相关性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

构建ASPP2/P53缺失的GP120转基因鼠研究HAD神经凋亡机制

国家自然科学基金

0+阅读 · 2008年12月31日

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering

Arxiv

1+阅读 · 2022年4月20日

A Brief Guide to Designing and Evaluating Human-Centered Interactive Machine Learning

Arxiv

0+阅读 · 2022年4月20日

Natural Language Processing in-and-for Design Research

Arxiv

0+阅读 · 2022年4月17日

Prefix-Free Coding for LQG Control

Prefix-Free Coding for LQG Control

Arxiv

0+阅读 · 2022年4月15日

Graph Neural Networks: Methods, Applications, and Opportunities

Arxiv

93+阅读 · 2021年8月24日

Overcoming Catastrophic Forgetting in Graph Neural Networks

Arxiv

14+阅读 · 2020年12月10日

Spectral Clustering with Graph Neural Networks for Graph Pooling

Arxiv

25+阅读 · 2020年6月3日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Arxiv

34+阅读 · 2019年10月24日

Graph Signal Processing -- Part I: Graphs, Graph Spectra, and Spectral Clustering

Arxiv

14+阅读 · 2019年8月12日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

VIP会员

文章信息

相关主题

Processing（编程语言）

Machine Learning

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【2022新书】Transformer自然语言处理，Natural Language Processing with Transformers: Building Language Applications with Hugging Face

【2022新书】Transformer自然语言处理，Natural Language Processing with Transformers: Building Language Applications with Hugging Face

专知会员服务

523+阅读 · 2022年1月31日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

Deep Research（深度研究）：系统性综述

《革新战术战场空间能力：反无人机系统》报告

【普林斯顿博士论文】用于语音的生成式通用模型

螺旋式开发作为战略资产：美军启示

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering

Arxiv

1+阅读 · 2022年4月20日

A Brief Guide to Designing and Evaluating Human-Centered Interactive Machine Learning

Arxiv

0+阅读 · 2022年4月20日

Natural Language Processing in-and-for Design Research

Arxiv

0+阅读 · 2022年4月17日

Prefix-Free Coding for LQG Control

Prefix-Free Coding for LQG Control

Arxiv

0+阅读 · 2022年4月15日

Graph Neural Networks: Methods, Applications, and Opportunities

Arxiv

93+阅读 · 2021年8月24日

Overcoming Catastrophic Forgetting in Graph Neural Networks

Arxiv

14+阅读 · 2020年12月10日

Spectral Clustering with Graph Neural Networks for Graph Pooling

Arxiv

25+阅读 · 2020年6月3日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Arxiv

34+阅读 · 2019年10月24日

Graph Signal Processing -- Part I: Graphs, Graph Spectra, and Spectral Clustering

Arxiv

14+阅读 · 2019年8月12日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

相关基金

靶向代谢型谷氨酸受体8变构调节位点的3DShapeSim药物发现与镇痛作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

近藤格点系统中的自旋轨道耦合和拓扑量子态

国家自然科学基金

0+阅读 · 2014年12月31日

结合连锁和关联分析方法剖析玉米硝酸还原酶的遗传机制

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

低秩矩阵复原的Schatten-q(0<q<1)正则化理论与算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

关于AI-半环簇与 Conway半环簇的研究

国家自然科学基金

1+阅读 · 2012年12月31日

Ce-Pb-Sb-Te体系中相关相图及热物理性能的研究

国家自然科学基金

1+阅读 · 2012年12月31日

面向智能视频监控的高度多摄像机信息融合

国家自然科学基金

2+阅读 · 2009年12月31日

ROS在细菌抗生素应激中的作用及与毒素-抗毒素系统相关性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

构建ASPP2/P53缺失的GP120转基因鼠研究HAD神经凋亡机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员