低自行车语言编码的神经特征预测器和偏差残留编码 (Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding) - 专知论文

会员服务 ·

0

预测器/决策函数 · 代码 · 判别器 · 可约的 · Analysis ·

2022 年 11 月 4 日

Neural Feature Predictor and Discriminative Residual Coding for Low-Bitrate Speech Coding

翻译：低自行车语言编码的神经特征预测器和偏差残留编码

Haici Yang,Wootaek Lim,Minje Kim

Low and ultra-low-bitrate neural speech coding achieves unprecedented coding gain by generating speech signals from compact speech features. This paper introduces additional coding efficiency in neural speech coding by reducing the temporal redundancy existing in the frame-level feature sequence via a recurrent neural predictor. The prediction can achieve a low-entropy residual representation, which we discriminatively code based on their contribution to the signal reconstruction. The harmonization of feature prediction and discriminative coding results in a dynamic bit allocation algorithm that spends more bits on unpredictable but rare events. As a result, we develop a scalable, lightweight, low-latency, and low-bitrate neural speech coding system. We demonstrate the advantage of the proposed methods using the LPCNet as a neural vocoder. While the proposed method guarantees causality in its prediction, the subjective tests and feature space analysis show that our model achieves superior coding efficiency compared to LPCNet and Lyra V2 in the very low bitrates.

翻译：低位和超低位神经语言编码通过生成来自紧凑语言特征的语音信号,实现了前所未有的编码收益。本文引入了神经语言编码的额外编码效率,通过经常性神经预测器减少框架级特征序列中存在的时间冗余。预测可以实现低湿度残留代表制, 我们根据其对信号重建的贡献, 对其进行了区别对待的编码。地貌预测和歧视性编码的协调统一导致一种动态的位数分配算法, 将更多的位数花在不可预测但罕见的事件上。因此, 我们开发了一个可缩放、轻量、低纬度和低位线性神经语音编码系统。我们展示了将 LPCNet 用作神经动力编码器的拟议方法的优势。虽然拟议方法保证了其预测中的因果关系, 主观测试和特征空间分析表明,我们的模型实现了高于低位比特的LPCNet 和 Lyra V2 的编码效率。

0

相关内容

预测器/决策函数

预测器/决策函数

【图机器学习进展与趋势@ICML2022】Graph Machine Learning @ ICML 2022

【图机器学习进展与趋势@ICML2022】Graph Machine Learning @ ICML 2022

专知会员服务

40+阅读 · 2022年7月25日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

介孔复合微纳结构CaTi2O5的可控制备及光催化性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Metasurface的THz慢波器件研究

国家自然科学基金

0+阅读 · 2013年12月31日

单通道同频混合信号的盲可分离性、性能界及算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于PCE的多层多域光网络QoS组播路由多目标优化算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

金刚石上石墨烯的自组织生长与电学特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

硅基III-V族外延量子点RTD隧穿陀螺仪的基础效应研究

国家自然科学基金

0+阅读 · 2011年12月31日

SIRT2在神经细胞能量代谢损伤和神经炎症中的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

类石墨烯结构新型光催化材料的构筑及其降解有机污染物研究

国家自然科学基金

0+阅读 · 2011年12月31日

氮掺杂石墨烯基燃料电池催化剂的可控制备及其催化性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

Dyrk1A调控CaMKⅡ#948;的可变剪接及其在心脏重构过程中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error

Arxiv

0+阅读 · 2022年12月26日

Coded Caching Schemes for Two-dimensional Caching-aided Ultra-Dense Networks

Arxiv

0+阅读 · 2022年12月26日

A Convergence Rate for Manifold Neural Networks

Arxiv

0+阅读 · 2022年12月23日

DaDe: Delay-adaptive Detector for Streaming Perception

Arxiv

0+阅读 · 2022年12月23日

Incremental Predictive Coding: A Parallel and Fully Automatic Learning Algorithm

Arxiv

0+阅读 · 2022年11月16日

A Versatile Diffusion-based Generative Refiner for Speech Enhancement

Arxiv

0+阅读 · 2022年10月27日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

Adaptive Attentional Network for Few-Shot Knowledge Graph Completion

Arxiv

17+阅读 · 2020年10月19日

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Arxiv

14+阅读 · 2019年6月19日

NDDR-CNN: Layer-wise Feature Fusing in Multi-Task CNN by Neural Discriminative Dimensionality Reduction

Arxiv

15+阅读 · 2018年1月25日

VIP会员

文章信息

相关主题

预测器/决策函数

相关VIP内容

【图机器学习进展与趋势@ICML2022】Graph Machine Learning @ ICML 2022

【图机器学习进展与趋势@ICML2022】Graph Machine Learning @ ICML 2022

专知会员服务

40+阅读 · 2022年7月25日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error

Arxiv

0+阅读 · 2022年12月26日

Coded Caching Schemes for Two-dimensional Caching-aided Ultra-Dense Networks

Arxiv

0+阅读 · 2022年12月26日

A Convergence Rate for Manifold Neural Networks

Arxiv

0+阅读 · 2022年12月23日

DaDe: Delay-adaptive Detector for Streaming Perception

Arxiv

0+阅读 · 2022年12月23日

Incremental Predictive Coding: A Parallel and Fully Automatic Learning Algorithm

Arxiv

0+阅读 · 2022年11月16日

A Versatile Diffusion-based Generative Refiner for Speech Enhancement

Arxiv

0+阅读 · 2022年10月27日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

Adaptive Attentional Network for Few-Shot Knowledge Graph Completion

Arxiv

17+阅读 · 2020年10月19日

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Arxiv

14+阅读 · 2019年6月19日

NDDR-CNN: Layer-wise Feature Fusing in Multi-Task CNN by Neural Discriminative Dimensionality Reduction

Arxiv

15+阅读 · 2018年1月25日

相关基金

介孔复合微纳结构CaTi2O5的可控制备及光催化性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Metasurface的THz慢波器件研究

国家自然科学基金

0+阅读 · 2013年12月31日

单通道同频混合信号的盲可分离性、性能界及算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于PCE的多层多域光网络QoS组播路由多目标优化算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

金刚石上石墨烯的自组织生长与电学特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

硅基III-V族外延量子点RTD隧穿陀螺仪的基础效应研究

国家自然科学基金

0+阅读 · 2011年12月31日

SIRT2在神经细胞能量代谢损伤和神经炎症中的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

类石墨烯结构新型光催化材料的构筑及其降解有机污染物研究

国家自然科学基金

0+阅读 · 2011年12月31日

氮掺杂石墨烯基燃料电池催化剂的可控制备及其催化性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

Dyrk1A调控CaMKⅡ#948;的可变剪接及其在心脏重构过程中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员