独立说话人短语分割模型在端到端语音合成系统中的应用探究 (An investigation of speaker independent phrase break models in End-to-End TTS systems) - 专知论文

会员服务 ·

0

合成系统 · 合成 · 语音合成 · 端到端 · 分割 ·

2023 年 4 月 21 日

An investigation of speaker independent phrase break models in End-to-End TTS systems

翻译：独立说话人短语分割模型在端到端语音合成系统中的应用探究

Anandaswarup Vadapalli

from arxiv, Submitted for review to IEEE Access

This paper presents our work on phrase break prediction in the context of end-to-end TTS systems, motivated by the following questions: (i) Is there any utility in incorporating an explicit phrasing model in an end-to-end TTS system?, and (ii) How do you evaluate the effectiveness of a phrasing model in an end-to-end TTS system? In particular, the utility and effectiveness of phrase break prediction models are evaluated in in the context of childrens story synthesis, using listener comprehension. We show by means of perceptual listening evaluations that there is a clear preference for stories synthesized after predicting the location of phrase breaks using a trained phrasing model, over stories directly synthesized without predicting the location of phrase breaks.

翻译：本文在端到端语音合成系统中研究短语分割预测，研究动机为：（i）在端到端语音合成系统中引入显式短语模型是否具有效用？（ii）如何评估端到端语音合成系统中短语模型的有效性？具体来说，我们以儿童故事合成为背景，使用听众理解度来评估短语分割预测模型的效用和有效性。我们通过感知听评估表明，使用经过训练的短语模型预测短语断点位置合成的故事要优于直接合成的故事，而没有预测短语断点位置。

0

相关内容

合成系统

【2022新书】文本生成的深度学习方法，201页pdf，Deep Learning Approaches to Text Production

【2022新书】文本生成的深度学习方法，201页pdf，Deep Learning Approaches to Text Production

专知会员服务

39+阅读 · 2022年5月28日

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

专知会员服务

13+阅读 · 2022年3月12日

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

【Hinton新论文】语言建模目标检测Pix2seq

【Hinton新论文】语言建模目标检测Pix2seq

专知会员服务

26+阅读 · 2021年9月23日

【ACL2021】预训练语言模型的少样本知识图谱文本生成

专知会员服务

39+阅读 · 2021年6月6日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

专知会员服务

78+阅读 · 2020年8月13日

【ICML2020-Google】预训练提取的空白句子以便进行抽象摘要

【ICML2020-Google】预训练提取的空白句子以便进行抽象摘要

专知会员服务

20+阅读 · 2020年7月1日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

专知

10+阅读 · 2018年4月12日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

融入偏好的公路多目标三维导向线生成模型及算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于主干成分的句法统计机器翻译模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

汉语句法分析中的自动歧义识别和分类问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多视角学习的情感分析理论与方法研究

国家自然科学基金

2+阅读 · 2013年12月31日

不确定性推理与语义网中知识表示的数学基础

国家自然科学基金

18+阅读 · 2012年12月31日

基于概率化SC文法的多策略机器翻译研究

国家自然科学基金

0+阅读 · 2012年12月31日

力反馈与光学导航交互控制颅颌面外科手术辅助七自由度机器人的关键技术及算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多属性决策网MADN的仿真系统VV&A理论方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

维吾尔语框架语义角色自动标注技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于模型自适应修正和协同决策的说话人鲁棒语音情感识别方法研究

国家自然科学基金

0+阅读 · 2010年12月31日

TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision

Arxiv

0+阅读 · 2023年6月6日

Interactive Editing for Text Summarization

Arxiv

0+阅读 · 2023年6月5日

Which Argumentative Aspects of Hate Speech in Social Media can be reliably identified?

Arxiv

0+阅读 · 2023年6月5日

Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model

Arxiv

0+阅读 · 2023年6月5日

APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning

Arxiv

0+阅读 · 2023年6月5日

Machine Learning Testing in an ADAS Case Study Using Simulation-Integrated Bio-Inspired Search-Based Testing

Arxiv

0+阅读 · 2023年6月4日

Multilingual Conceptual Coverage in Text-to-Image Models

Arxiv

0+阅读 · 2023年6月2日

Distilling Efficient Language-Specific Models for Cross-Lingual Transfer

Arxiv

0+阅读 · 2023年6月2日

Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts

Arxiv

0+阅读 · 2023年6月2日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】文本生成的深度学习方法，201页pdf，Deep Learning Approaches to Text Production

【2022新书】文本生成的深度学习方法，201页pdf，Deep Learning Approaches to Text Production

专知会员服务

39+阅读 · 2022年5月28日

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

专知会员服务

13+阅读 · 2022年3月12日

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

【Hinton新论文】语言建模目标检测Pix2seq

【Hinton新论文】语言建模目标检测Pix2seq

专知会员服务

26+阅读 · 2021年9月23日

【ACL2021】预训练语言模型的少样本知识图谱文本生成

专知会员服务

39+阅读 · 2021年6月6日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

【神经自然语言处理进展：建模，学习，推理】Progress in Neural NLP: Modeling, Learning, and Reasoning

专知会员服务

78+阅读 · 2020年8月13日

【ICML2020-Google】预训练提取的空白句子以便进行抽象摘要

【ICML2020-Google】预训练提取的空白句子以便进行抽象摘要

专知会员服务

20+阅读 · 2020年7月1日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型时代的文档智能：综述

蜂窝通信是否是无人机与无人地面战车主宰战场的关键？

文档视觉问答简述

最新新Agent综述！76页327篇论文梳理，北交大桑基韬教授团队发布《迈向模型原生智能体式人工智能的范式转变综述》

相关资讯

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

专知

10+阅读 · 2018年4月12日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

相关论文

TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision

Arxiv

0+阅读 · 2023年6月6日

Interactive Editing for Text Summarization

Arxiv

0+阅读 · 2023年6月5日

Which Argumentative Aspects of Hate Speech in Social Media can be reliably identified?

Arxiv

0+阅读 · 2023年6月5日

Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model

Arxiv

0+阅读 · 2023年6月5日

APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning

Arxiv

0+阅读 · 2023年6月5日

Machine Learning Testing in an ADAS Case Study Using Simulation-Integrated Bio-Inspired Search-Based Testing

Arxiv

0+阅读 · 2023年6月4日

Multilingual Conceptual Coverage in Text-to-Image Models

Arxiv

0+阅读 · 2023年6月2日

Distilling Efficient Language-Specific Models for Cross-Lingual Transfer

Arxiv

0+阅读 · 2023年6月2日

Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts

Arxiv

0+阅读 · 2023年6月2日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

相关基金

融入偏好的公路多目标三维导向线生成模型及算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于主干成分的句法统计机器翻译模型研究

国家自然科学基金

0+阅读 · 2013年12月31日

汉语句法分析中的自动歧义识别和分类问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多视角学习的情感分析理论与方法研究

国家自然科学基金

2+阅读 · 2013年12月31日

不确定性推理与语义网中知识表示的数学基础

国家自然科学基金

18+阅读 · 2012年12月31日

基于概率化SC文法的多策略机器翻译研究

国家自然科学基金

0+阅读 · 2012年12月31日

力反馈与光学导航交互控制颅颌面外科手术辅助七自由度机器人的关键技术及算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于多属性决策网MADN的仿真系统VV&A理论方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

维吾尔语框架语义角色自动标注技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于模型自适应修正和协同决策的说话人鲁棒语音情感识别方法研究

国家自然科学基金

0+阅读 · 2010年12月31日

微信扫码咨询专知VIP会员