在聋人和听力使用者和听力困难使用者的谈话分页中,使用BERT 模拟单词重要性 (Using BERT Embeddings to Model Word Importance in Conversational Transcripts for Deaf and Hard of Hearing Users)

Deaf and hard of hearing individuals regularly rely on captioning while watching live TV. Live TV captioning is evaluated by regulatory agencies using various caption evaluation metrics. However, caption evaluation metrics are often not informed by preferences of DHH users or how meaningful the captions are. There is a need to construct caption evaluation metrics that take the relative importance of words in a transcript into account. We conducted correlation analysis between two types of word embeddings and human-annotated labeled word-importance scores in existing corpus. We found that normalized contextualized word embeddings generated using BERT correlated better with manually annotated importance scores than word2vec-based word embeddings. We make available a pairing of word embeddings and their human-annotated importance scores. We also provide proof-of-concept utility by training word importance models, achieving an F1-score of 0.57 in the 6-class word importance classification task.

翻译：听力和听力困难的个人在观看现场电视时经常依赖字幕。现场电视字幕由监管机构使用各种字幕评价指标进行评估。但是,标题评价指标往往不因DHH用户的偏好或字幕的有意义的程度而了解。有必要构建在记录稿中考虑到文字相对重要性的字幕评价指标。我们在两种类型的单词嵌入和现有文体中贴有标签的文字重要性评分之间进行了相关分析。我们发现,使用BERT生成的标准化背景字嵌入比基于 word2vec 的单词嵌入的手动附加重要分数更好。我们提供配对的单词嵌入及其人附加重要分数。我们还通过培训名重要性模型提供证明概念的实用性,在6级词汇重要性分类任务中达到0.57的F1分。

相关内容

词向量表示

关注 37

分散式表示即将语言表示为稠密、低维、连续的向量。研究者最早发现学习得到词嵌入之间存在类比关系。比如apple−apples ≈ car−cars， man−woman ≈ king – queen 等。这些方法都可以直接在大规模无标注语料上进行训练。词嵌入的质量也非常依赖于上下文窗口大小的选择。通常大的上下文窗口学到的词嵌入更反映主题信息，而小的上下文窗口学到的词嵌入更反映词的功能和上下文语义信息。

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日