TaL:一个同步的多声波超声波舌像成像、音频和唇音视频的多声频谱集 (TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos) - 专知论文

会员服务 ·

0

情景 · 语音合成 · 语音识别 · 近似 · CC ·

2020 年 11 月 19 日

TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos

翻译：TaL:一个同步的多声波超声波舌像成像、音频和唇音视频的多声频谱集

Manuel Sam Ribeiro,Jennifer Sanger,Jing-Xuan Zhang,Aciel Eshky,Alan Wrench,Korin Richmond,Steve Renals

from arxiv, 8 pages, 4 figures, Accepted to SLT2021, IEEE Spoken Language Technology Workshop

We present the Tongue and Lips corpus (TaL), a multi-speaker corpus of audio, ultrasound tongue imaging, and lip videos. TaL consists of two parts: TaL1 is a set of six recording sessions of one professional voice talent, a male native speaker of English; TaL80 is a set of recording sessions of 81 native speakers of English without voice talent experience. Overall, the corpus contains 24 hours of parallel ultrasound, video, and audio data, of which approximately 13.5 hours are speech. This paper describes the corpus and presents benchmark results for the tasks of speech recognition, speech synthesis (articulatory-to-acoustic mapping), and automatic synchronisation of ultrasound to audio. The TaL corpus is publicly available under the CC BY-NC 4.0 license.

翻译：我们展示了“舌声和嘴唇声声”(TAL),这是一个多语种的音频、超声波舌成像和唇语视频库。TAL由两部分组成:TAL1是一套由一位专业语音人才(英语男性母语)组成的六次录音会议;TAL80是一套81个英语本地人(没有声音才经验)的录音会议。总体而言,TAL包含24小时的平行超声波、视频和音频数据,其中约13.5小时为演讲时间。本文描述了声音识别、语音合成(人工合成)和超声波自动同步工作的基本结果,根据CC BY-NC 4.0的许可证,可公开查阅TAL声波。

0

相关内容

多伦多大学Fall2020《机器学习导论》课程，不可错过！

专知会员服务

55+阅读 · 2020年10月11日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

109+阅读 · 2020年8月4日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

【Freddy Lecue 博士|公开演讲】可解释XAI人工智能进展（Explainable AI-The Story So Far），Sungkyunkwan University 2019

【Freddy Lecue 博士|公开演讲】可解释XAI人工智能进展（Explainable AI-The Story So Far），Sungkyunkwan University 2019

专知会员服务

32+阅读 · 2019年11月11日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

收藏 | 超全开源数据集，你真的不想要吗？（附链接）

收藏 | 超全开源数据集，你真的不想要吗？（附链接）

THU数据派

3+阅读 · 2018年9月17日

资源 | 一份非常全面的开源数据集

资源 | 一份非常全面的开源数据集

黑龙江大学自然语言处理实验室

10+阅读 · 2018年9月7日

五个精彩实用的自然语言处理资源

五个精彩实用的自然语言处理资源

机器学习研究会

6+阅读 · 2018年2月23日

深度学习、机器学习图像/人脸/字幕/自动驾驶数据集(Dataset)汇总

深度学习、机器学习图像/人脸/字幕/自动驾驶数据集(Dataset)汇总

数据挖掘入门与实战

3+阅读 · 2018年1月16日

Generating coherent spontaneous speech and gesture from text

Arxiv

0+阅读 · 2021年1月14日

Speaker activity driven neural speech extraction

Arxiv

0+阅读 · 2021年1月14日

End-to-End Speaker Height and age estimation using Attention Mechanism with LSTM-RNN

Arxiv

0+阅读 · 2021年1月13日

A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline

Arxiv

0+阅读 · 2021年1月13日

The Vulnerability of Semantic Segmentation Networks to Adversarial Attacks in Autonomous Driving: Enhancing Extensive Environment Sensing

Arxiv

0+阅读 · 2021年1月13日

Multimodal Engagement Analysis from Facial Videos in the Classroom

Arxiv

0+阅读 · 2021年1月11日

Self-Supervised Learning by Cross-Modal Audio-Video Clustering

Arxiv

6+阅读 · 2020年10月26日

Scene Text Detection and Recognition: The Deep Learning Era

Scene Text Detection and Recognition: The Deep Learning Era

Arxiv

27+阅读 · 2019年9月5日

Improved Speech Enhancement with the Wave-U-Net

Arxiv

8+阅读 · 2018年11月27日

Topic Modelling of Everyday Sexism Project Entries

Arxiv

3+阅读 · 2018年4月5日

VIP会员

文章信息

相关主题

相关VIP内容

多伦多大学Fall2020《机器学习导论》课程，不可错过！

专知会员服务

55+阅读 · 2020年10月11日

迁移学习简明教程，11页ppt

迁移学习简明教程，11页ppt

专知会员服务

109+阅读 · 2020年8月4日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

【Freddy Lecue 博士|公开演讲】可解释XAI人工智能进展（Explainable AI-The Story So Far），Sungkyunkwan University 2019

【Freddy Lecue 博士|公开演讲】可解释XAI人工智能进展（Explainable AI-The Story So Far），Sungkyunkwan University 2019

专知会员服务

32+阅读 · 2019年11月11日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

自动驾驶轨迹规划中的基础模型：进展综述与开放挑战

《用于提升多域战备的大型语言模型辅助场景生成器》报告

【斯坦福博士论文】为人类使用优化 AI 模型

国防领域人工智能规模化应用的理论与实践

相关资讯

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

收藏 | 超全开源数据集，你真的不想要吗？（附链接）

收藏 | 超全开源数据集，你真的不想要吗？（附链接）

THU数据派

3+阅读 · 2018年9月17日

资源 | 一份非常全面的开源数据集

资源 | 一份非常全面的开源数据集

黑龙江大学自然语言处理实验室

10+阅读 · 2018年9月7日

五个精彩实用的自然语言处理资源

五个精彩实用的自然语言处理资源

机器学习研究会

6+阅读 · 2018年2月23日

深度学习、机器学习图像/人脸/字幕/自动驾驶数据集(Dataset)汇总

深度学习、机器学习图像/人脸/字幕/自动驾驶数据集(Dataset)汇总

数据挖掘入门与实战

3+阅读 · 2018年1月16日

相关论文

Generating coherent spontaneous speech and gesture from text

Arxiv

0+阅读 · 2021年1月14日

Speaker activity driven neural speech extraction

Arxiv

0+阅读 · 2021年1月14日

End-to-End Speaker Height and age estimation using Attention Mechanism with LSTM-RNN

Arxiv

0+阅读 · 2021年1月13日

A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline

Arxiv

0+阅读 · 2021年1月13日

The Vulnerability of Semantic Segmentation Networks to Adversarial Attacks in Autonomous Driving: Enhancing Extensive Environment Sensing

Arxiv

0+阅读 · 2021年1月13日

Multimodal Engagement Analysis from Facial Videos in the Classroom

Arxiv

0+阅读 · 2021年1月11日

Self-Supervised Learning by Cross-Modal Audio-Video Clustering

Arxiv

6+阅读 · 2020年10月26日

Scene Text Detection and Recognition: The Deep Learning Era

Scene Text Detection and Recognition: The Deep Learning Era

Arxiv

27+阅读 · 2019年9月5日

Improved Speech Enhancement with the Wave-U-Net

Arxiv

8+阅读 · 2018年11月27日

Topic Modelling of Everyday Sexism Project Entries

Arxiv

3+阅读 · 2018年4月5日

微信扫码咨询专知VIP会员