研究标题：重视手语翻译中演讲者重合的重要性研究摘要：手语翻译，即识别某人是否在进行手语表达，对于远程会议软件的应用和选择有用的手语数据进行培训手语识别或翻译任务越发重要。本文认为当前手语翻译基准数据集过度乐观地估计了结果，没有很好地实现泛化，因为训练和测试分区之间的演讲者重叠。我们通过详细分析演讲者重叠对当前手语翻译基准数据集的影响来量化这一点。通过比较DGS语料库和Signing in the Wild的有重叠和没有重叠的准确性，我们观察到相对准确性下降了4.17％和6.27％，并提出了新的数据集分区，这些数据集不重叠，可以使性能评估更加现实。我们希望这项工作能有助于提高手语翻译系统的准确性和泛化性。 (On the Importance of Signer Overlap for Sign Language Detection) - 专知论文

会员服务 ·

0

模型评估 · 可辨认的 · Performer · 估计/估计量 · Analysis ·

2023 年 3 月 19 日

On the Importance of Signer Overlap for Sign Language Detection

翻译：研究标题：重视手语翻译中演讲者重合的重要性研究摘要：手语翻译，即识别某人是否在进行手语表达，对于远程会议软件的应用和选择有用的手语数据进行培训手语识别或翻译任务越发重要。本文认为当前手语翻译基准数据集过度乐观地估计了结果，没有很好地实现泛化，因为训练和测试分区之间的演讲者重叠。我们通过详细分析演讲者重叠对当前手语翻译基准数据集的影响来量化这一点。通过比较DGS语料库和Signing in the Wild的有重叠和没有重叠的准确性，我们观察到相对准确性下降了4.17％和6.27％，并提出了新的数据集分区，这些数据集不重叠，可以使性能评估更加现实。我们希望这项工作能有助于提高手语翻译系统的准确性和泛化性。

Abhilash Pal,Stephan Huber,Cyrine Chaabani,Alessandro Manzotti,Oscar Koller

Sign language detection, identifying if someone is signing or not, is becoming crucially important for its applications in remote conferencing software and for selecting useful sign data for training sign language recognition or translation tasks. We argue that the current benchmark data sets for sign language detection estimate overly positive results that do not generalize well due to signer overlap between train and test partitions. We quantify this with a detailed analysis of the effect of signer overlap on current sign detection benchmark data sets. Comparing accuracy with and without overlap on the DGS corpus and Signing in the Wild, we observed a relative decrease in accuracy of 4.17% and 6.27%, respectively. Furthermore, we propose new data set partitions that are free of overlap and allow for more realistic performance assessment. We hope this work will contribute to improving the accuracy and generalization of sign language detection systems.

翻译：

0

相关内容

模型评估

机器学习系统设计系统评估标准

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【ICML2020-Google】预训练提取的空白句子以便进行抽象摘要

【ICML2020-Google】预训练提取的空白句子以便进行抽象摘要

专知会员服务

20+阅读 · 2020年7月1日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

专知会员服务

18+阅读 · 2019年12月14日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

专知

17+阅读 · 2019年1月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

专知

10+阅读 · 2018年4月12日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

以哇巴因为探针药物的肿瘤细胞分子伴侣介导的自噬信号转导途径研究

国家自然科学基金

0+阅读 · 2015年12月31日

广东话背景的失乐症者声调和音乐的发声和感知

国家自然科学基金

0+阅读 · 2015年12月31日

铁电材料液氦温区比热是否存在T^3/2贡献？

国家自然科学基金

0+阅读 · 2014年12月31日

P38 MAPK信号通路在S. boulardii预防DON诱导猪单核巨噬细胞凋亡的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

大豆抗镉和籽粒低积累的分子机理解析

国家自然科学基金

0+阅读 · 2012年12月31日

未熟-低熟油中金刚烷烃类化合物的成因及其潜在意义研究

国家自然科学基金

0+阅读 · 2012年12月31日

8细胞胚胎紧密化中多个信号通路对Ezrin磷酸化的调控

国家自然科学基金

0+阅读 · 2012年12月31日

搜索引擎广告的关键词筛选与竞价策略：考虑多重约束的理论模型与实证研究

国家自然科学基金

0+阅读 · 2011年12月31日

中国对虾"黄海2号"对WSSV抗性的遗传决定分析

国家自然科学基金

0+阅读 · 2011年12月31日

自交不亲和信号传递因子ARC1相互作用蛋白的筛选及功能分析

国家自然科学基金

0+阅读 · 2009年12月31日

Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM

Arxiv

0+阅读 · 2023年5月9日

Boosting Visual-Language Models by Exploiting Hard Samples

Arxiv

0+阅读 · 2023年5月9日

Child Palm-ID: Contactless Palmprint Recognition for Children

Arxiv

0+阅读 · 2023年5月9日

SignBERT+: Hand-model-aware Self-supervised Pre-training for Sign Language Understanding

Arxiv

0+阅读 · 2023年5月8日

Inferring Features with Uncertain Roughness

Arxiv

0+阅读 · 2023年5月8日

On the Blind Spots of Model-Based Evaluation Metrics for Text Generation

Arxiv

0+阅读 · 2023年5月5日

The Application of Affective Measures in Text-based Emotion Aware Recommender Systems

Arxiv

0+阅读 · 2023年5月4日

A Systematic Survey on Deep Generative Models for Graph Generation

Arxiv

18+阅读 · 2022年10月4日

Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding

Arxiv

12+阅读 · 2021年12月30日

Deep Semantic Role Labeling with Self-Attention

Arxiv

13+阅读 · 2017年12月5日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【ICML2020-Google】预训练提取的空白句子以便进行抽象摘要

【ICML2020-Google】预训练提取的空白句子以便进行抽象摘要

专知会员服务

20+阅读 · 2020年7月1日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

【论文推荐】将机器语言模型扩展到人类级别的语言理解，Extending Machine Language Models toward Human-Level Language Understanding

专知会员服务

18+阅读 · 2019年12月14日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

数据要素发展报告(2025年)：附下载

人工智能代理提升战时舰船战备水平

【NeurIPS2025教程】大语言模型规划

NeurIPS 2025 教程：深度学习训练不稳定性的理论洞见

相关资讯

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

专知

17+阅读 · 2019年1月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

专知

10+阅读 · 2018年4月12日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

相关论文

Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM

Arxiv

0+阅读 · 2023年5月9日

Boosting Visual-Language Models by Exploiting Hard Samples

Arxiv

0+阅读 · 2023年5月9日

Child Palm-ID: Contactless Palmprint Recognition for Children

Arxiv

0+阅读 · 2023年5月9日

SignBERT+: Hand-model-aware Self-supervised Pre-training for Sign Language Understanding

Arxiv

0+阅读 · 2023年5月8日

Inferring Features with Uncertain Roughness

Arxiv

0+阅读 · 2023年5月8日

On the Blind Spots of Model-Based Evaluation Metrics for Text Generation

Arxiv

0+阅读 · 2023年5月5日

The Application of Affective Measures in Text-based Emotion Aware Recommender Systems

Arxiv

0+阅读 · 2023年5月4日

A Systematic Survey on Deep Generative Models for Graph Generation

Arxiv

18+阅读 · 2022年10月4日

Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding

Arxiv

12+阅读 · 2021年12月30日

Deep Semantic Role Labeling with Self-Attention

Arxiv

13+阅读 · 2017年12月5日

相关基金

以哇巴因为探针药物的肿瘤细胞分子伴侣介导的自噬信号转导途径研究

国家自然科学基金

0+阅读 · 2015年12月31日

广东话背景的失乐症者声调和音乐的发声和感知

国家自然科学基金

0+阅读 · 2015年12月31日

铁电材料液氦温区比热是否存在T^3/2贡献？

国家自然科学基金

0+阅读 · 2014年12月31日

P38 MAPK信号通路在S. boulardii预防DON诱导猪单核巨噬细胞凋亡的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

大豆抗镉和籽粒低积累的分子机理解析

国家自然科学基金

0+阅读 · 2012年12月31日

未熟-低熟油中金刚烷烃类化合物的成因及其潜在意义研究

国家自然科学基金

0+阅读 · 2012年12月31日

8细胞胚胎紧密化中多个信号通路对Ezrin磷酸化的调控

国家自然科学基金

0+阅读 · 2012年12月31日

搜索引擎广告的关键词筛选与竞价策略：考虑多重约束的理论模型与实证研究

国家自然科学基金

0+阅读 · 2011年12月31日

中国对虾"黄海2号"对WSSV抗性的遗传决定分析

国家自然科学基金

0+阅读 · 2011年12月31日

自交不亲和信号传递因子ARC1相互作用蛋白的筛选及功能分析

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员