终端到终端语音翻译的非几何域域适应 (Non-Parametric Domain Adaptation for End-to-End Speech Translation) - 专知论文

会员服务 ·

0

语音翻译 · 端到端 · 训练数据 · 可约的 · 表示 ·

2022 年 10 月 26 日

Non-Parametric Domain Adaptation for End-to-End Speech Translation

翻译：终端到终端语音翻译的非几何域域适应

Yichao Du,Weizhi Wang,Zhirui Zhang,Boxing Chen,Tong Xu,Jun Xie,Enhong Chen

from arxiv, EMNLP 2022

End-to-End Speech Translation (E2E-ST) has received increasing attention due to the potential of its less error propagation, lower latency, and fewer parameters. However, the effectiveness of neural-based approaches to this task is severely limited by the available training corpus, especially for domain adaptation where in-domain triplet training data is scarce or nonexistent. In this paper, we propose a novel non-parametric method that leverages domain-specific text translation corpus to achieve domain adaptation for the E2E-ST system. To this end, we first incorporate an additional encoder into the pre-trained E2E-ST model to realize text translation modelling, and then unify the decoder's output representation for text and speech translation tasks by reducing the correspondent representation mismatch in available triplet training data. During domain adaptation, a k-nearest-neighbor (kNN) classifier is introduced to produce the final translation distribution using the external datastore built by the domain-specific text translation corpus, while the universal output representation is adopted to perform a similarity search. Experiments on the Europarl-ST benchmark demonstrate that when in-domain text translation data is involved only, our proposed approach significantly improves baseline by 12.82 BLEU on average in all translation directions, even outperforming the strong in-domain fine-tuning method.

翻译：终端到终端语音翻译(E2E-ST)因其传播错误较少、延迟度较低和参数较少而受到越来越多的关注。然而,现有培训资料严重限制了基于神经的方法对这项任务的有效性,特别是在内部三重培训数据稀缺或不存在的域性适应方面,现有培训资料严重限制了以神经为基础的方法对这方面任务的有效性,特别是在内部三重培训数据稀缺或不存在的域性适应方面。在本文件中,我们提出了一个新的非参数方法,利用特定域文本翻译资料,实现E2E-ST系统的域性适应。为此,我们首先将额外的编码器纳入经过预先培训的E2E-ST模型,以实现文本翻译建模,然后通过减少现有三重培训数据中的代理代表不匹配,统一文本和语音翻译任务的解码器输出。在进行域性调整时,我们引入了 k-earest-neighbor (kNNN) 分类,以便利用特定域文本翻译资料所建的外部数据库进行最终翻译分发,同时采用通用产出代表制来进行类似的搜索。甚至对EOP-ST的输出进行实验,在拟议的平均翻译方法中,只有BSustimal-L基准格式,因此,在拟议的翻译方法中要大幅度改进了我们所有基准中的数据方向。

0

相关内容

语音翻译

通过计算机进行不同语言之间的直接语音翻译，辅助不同语言背景的人们进行沟通已经成为世界各国研究的重点。和一般的文本翻译不同，语音翻译需要把语音识别、机器翻译和语音合成三大技术进行集成，具有很大的挑战性。

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【MM 2021】VLAD-VSA: 基于词表分离和自适应的跨领域人脸欺诈检测，VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation

【MM 2021】VLAD-VSA: 基于词表分离和自适应的跨领域人脸欺诈检测，VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation

专知会员服务

5+阅读 · 2022年3月22日

【Nils Reimers】神经搜索的无监督域自适应，Unsupervised domain adaptation for neural search

【Nils Reimers】神经搜索的无监督域自适应，Unsupervised domain adaptation for neural search

专知会员服务

10+阅读 · 2022年3月8日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

近期必读的6篇CVPR 2020【域自适应（Domain Adaptation）】相关论文和代码

近期必读的6篇CVPR 2020【域自适应（Domain Adaptation）】相关论文和代码

专知会员服务

96+阅读 · 2020年3月24日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

CD2相关蛋白在阿尔茨海默病Tau蛋白介导的神经元损害中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于车体姿态预估的果园作业车主动安全控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

TiNi形状记忆合金表面W离子注入改性及其机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Rydberg Blockade条件下的量子相干与量子信息处理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

β钛合金中孪生诱导塑性(TWIP)效应

国家自然科学基金

0+阅读 · 2012年12月31日

铜螯合剂由抑制SOD1的活性调控胞内氧化还原信号转导

国家自然科学基金

0+阅读 · 2012年12月31日

春小麦淀粉形成对花后高温干旱的响应机制

国家自然科学基金

0+阅读 · 2011年12月31日

Fuzzy Domain 理论及其新拓扑工具研究

国家自然科学基金

0+阅读 · 2010年12月31日

铝通过Ca2+敏感的通路抑制Ⅰ#32452;mGluR依赖LTD机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

双组分信号转导系统CpxA/CpxR调控胸膜肺炎放线杆菌致病性机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

Trajectory Adaptive Prediction for Moving Objects in Uncertain Environment

Arxiv

0+阅读 · 2022年12月13日

Ensembling Transformers for Cross-domain Automatic Term Extraction

Arxiv

0+阅读 · 2022年12月12日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Arxiv

19+阅读 · 2021年4月19日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

A Survey of Domain Adaptation for Neural Machine Translation

Arxiv

18+阅读 · 2018年6月1日

Unsupervised Cross-Modality Domain Adaptation of ConvNets for Biomedical Image Segmentations with Adversarial Loss

Arxiv

10+阅读 · 2018年4月29日

Domain Adaptive Faster R-CNN for Object Detection in the Wild

Arxiv

10+阅读 · 2018年3月8日

VIP会员

文章信息

相关主题

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【MM 2021】VLAD-VSA: 基于词表分离和自适应的跨领域人脸欺诈检测，VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation

【MM 2021】VLAD-VSA: 基于词表分离和自适应的跨领域人脸欺诈检测，VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation

专知会员服务

5+阅读 · 2022年3月22日

【Nils Reimers】神经搜索的无监督域自适应，Unsupervised domain adaptation for neural search

【Nils Reimers】神经搜索的无监督域自适应，Unsupervised domain adaptation for neural search

专知会员服务

10+阅读 · 2022年3月8日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

近期必读的6篇CVPR 2020【域自适应（Domain Adaptation）】相关论文和代码

近期必读的6篇CVPR 2020【域自适应（Domain Adaptation）】相关论文和代码

专知会员服务

96+阅读 · 2020年3月24日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025教程】基础模型遇见具身智能体

军事机器学习设计：关于开发自动化任务摘要系统的梯次化设计科学研究 | 2025最新93页

扩散模型中的缓存方法综述：迈向高效的多模态生成

【ICCV2025教程】《迈向视觉语言模型的全面推理》

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

相关论文

Trajectory Adaptive Prediction for Moving Objects in Uncertain Environment

Arxiv

0+阅读 · 2022年12月13日

Ensembling Transformers for Cross-domain Automatic Term Extraction

Arxiv

0+阅读 · 2022年12月12日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

Arxiv

19+阅读 · 2021年4月19日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Diverse Image-to-Image Translation via Disentangled Representations

Diverse Image-to-Image Translation via Disentangled Representations

Arxiv

13+阅读 · 2018年8月2日

A Survey of Domain Adaptation for Neural Machine Translation

Arxiv

18+阅读 · 2018年6月1日

Unsupervised Cross-Modality Domain Adaptation of ConvNets for Biomedical Image Segmentations with Adversarial Loss

Arxiv

10+阅读 · 2018年4月29日

Domain Adaptive Faster R-CNN for Object Detection in the Wild

Arxiv

10+阅读 · 2018年3月8日

相关基金

CD2相关蛋白在阿尔茨海默病Tau蛋白介导的神经元损害中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

基于车体姿态预估的果园作业车主动安全控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

TiNi形状记忆合金表面W离子注入改性及其机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

Rydberg Blockade条件下的量子相干与量子信息处理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

β钛合金中孪生诱导塑性(TWIP)效应

国家自然科学基金

0+阅读 · 2012年12月31日

铜螯合剂由抑制SOD1的活性调控胞内氧化还原信号转导

国家自然科学基金

0+阅读 · 2012年12月31日

春小麦淀粉形成对花后高温干旱的响应机制

国家自然科学基金

0+阅读 · 2011年12月31日

Fuzzy Domain 理论及其新拓扑工具研究

国家自然科学基金

0+阅读 · 2010年12月31日

铝通过Ca2+敏感的通路抑制Ⅰ#32452;mGluR依赖LTD机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

双组分信号转导系统CpxA/CpxR调控胸膜肺炎放线杆菌致病性机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员