低资源语言的视觉基础关键词探测和本地化 (Visually Grounded Keyword Detection and Localisation for Low-Resource Languages) - 专知论文

会员服务 ·

0

MoDELS · Performer · 查准率/准确率 · Analysis · 数据集 ·

2023 年 2 月 1 日

Visually Grounded Keyword Detection and Localisation for Low-Resource Languages

翻译：低资源语言的视觉基础关键词探测和本地化

Kayode Kolawole Olaleye

from arxiv, PhD dissertation, University of Stellenbosch, 108 pages, submitted and accepted 2023

This study investigates the use of Visually Grounded Speech (VGS) models for keyword localisation in speech. The study focusses on two main research questions: (1) Is keyword localisation possible with VGS models and (2) Can keyword localisation be done cross-lingually in a real low-resource setting? Four methods for localisation are proposed and evaluated on an English dataset, with the best-performing method achieving an accuracy of 57%. A new dataset containing spoken captions in Yoruba language is also collected and released for cross-lingual keyword localisation. The cross-lingual model obtains a precision of 16% in actual keyword localisation and this performance can be improved by initialising from a model pretrained on English data. The study presents a detailed analysis of the model's success and failure modes and highlights the challenges of using VGS models for keyword localisation in low-resource settings.

翻译：本研究调查了语言中关键词本地化使用视觉基调模式的情况。本研究侧重于两个主要研究问题:(1) 关键词本地化是否可用VGS模式;(2) 关键词本地化能否在真正的低资源环境下以跨语言进行?在英语数据集中提出并评估了四种本地化方法,最佳方法达到57%。还收集了含有Yoruba语口语字幕的新数据集,并发布用于跨语言关键词本地化。跨语言模型在实际关键词本地化中获得了16%的精确度,这一性能可以通过在英语数据上预先培训的模型初始化而得到改进。该研究详细分析了该模型的成功和失败模式,并突出强调了在低资源环境中使用VGS语言本地化模式的挑战。

0

相关内容

MoDELS

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

Underlay频谱共享方式下信号参数估计和调制识别的方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

超低温及其冻融循环下混凝土水热耦合传输机制与温度变形行为

国家自然科学基金

0+阅读 · 2014年12月31日

稀土RE-Mn基合金相图及相关化合物磁性研究

国家自然科学基金

0+阅读 · 2014年12月31日

miR-139 在奶牛乳腺发育及泌乳中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

地源热泵系统区域地温场地质环境热响应及其应用模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

低功耗高性能能量自激型电源管理集成电路

国家自然科学基金

0+阅读 · 2012年12月31日

软件指导的高性能计算机系统功耗和热量管理

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

低碳高锰TRIP/TWIP效应共生钢的变形机制和组织演变

国家自然科学基金

0+阅读 · 2009年12月31日

基于网络编码的无线网状网性能优化技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

SPEC: Summary Preference Decomposition for Low-Resource Abstractive Summarization

Arxiv

0+阅读 · 2023年3月24日

Confidence-Aware and Self-Supervised Image Anomaly Localisation

Arxiv

0+阅读 · 2023年3月23日

Open-Vocabulary Object Detection using Pseudo Caption Labels

Arxiv

0+阅读 · 2023年3月23日

Offensive Language and Hate Speech Detection for Danish

Arxiv

0+阅读 · 2023年3月23日

Rigidity-Aware Detection for 6D Object Pose Estimation

Arxiv

1+阅读 · 2023年3月22日

Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

Arxiv

0+阅读 · 2023年3月22日

XWikiGen: Cross-lingual Summarization for Encyclopedic Text Generation in Low Resource Languages

Arxiv

0+阅读 · 2023年3月22日

Optical Character Recognition and Transcription of Berber Signs from Images in a Low-Resource Language Amazigh

Arxiv

0+阅读 · 2023年3月21日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

VIP会员

文章信息

相关主题

查准率/准确率

相关VIP内容

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

相关论文

SPEC: Summary Preference Decomposition for Low-Resource Abstractive Summarization

Arxiv

0+阅读 · 2023年3月24日

Confidence-Aware and Self-Supervised Image Anomaly Localisation

Arxiv

0+阅读 · 2023年3月23日

Open-Vocabulary Object Detection using Pseudo Caption Labels

Arxiv

0+阅读 · 2023年3月23日

Offensive Language and Hate Speech Detection for Danish

Arxiv

0+阅读 · 2023年3月23日

Rigidity-Aware Detection for 6D Object Pose Estimation

Arxiv

1+阅读 · 2023年3月22日

Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

Arxiv

0+阅读 · 2023年3月22日

XWikiGen: Cross-lingual Summarization for Encyclopedic Text Generation in Low Resource Languages

Arxiv

0+阅读 · 2023年3月22日

Optical Character Recognition and Transcription of Berber Signs from Images in a Low-Resource Language Amazigh

Arxiv

0+阅读 · 2023年3月21日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

相关基金

Underlay频谱共享方式下信号参数估计和调制识别的方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

超低温及其冻融循环下混凝土水热耦合传输机制与温度变形行为

国家自然科学基金

0+阅读 · 2014年12月31日

稀土RE-Mn基合金相图及相关化合物磁性研究

国家自然科学基金

0+阅读 · 2014年12月31日

miR-139 在奶牛乳腺发育及泌乳中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

地源热泵系统区域地温场地质环境热响应及其应用模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

低功耗高性能能量自激型电源管理集成电路

国家自然科学基金

0+阅读 · 2012年12月31日

软件指导的高性能计算机系统功耗和热量管理

国家自然科学基金

0+阅读 · 2009年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

低碳高锰TRIP/TWIP效应共生钢的变形机制和组织演变

国家自然科学基金

0+阅读 · 2009年12月31日

基于网络编码的无线网状网性能优化技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员