使用大语言模式的缩写扩展 (Context-Aware Abbreviation Expansion Using Large Language Models) - 专知论文

会员服务 ·

0

语言模型化 · MoDELS · AAC · 模型评估 · 稳健性 ·

2022 年 5 月 11 日

Context-Aware Abbreviation Expansion Using Large Language Models

翻译：使用大语言模式的缩写扩展

Shanqing Cai,Subhashini Venugopalan,Katrin Tomanek,Ajit Narayanan,Meredith Ringel Morris,Michael P. Brenner

from arxiv, 15 pages, 7 figures, 8 tables. Accepted as a long paper at NAACL 2022

Motivated by the need for accelerating text entry in augmentative and alternative communication (AAC) for people with severe motor impairments, we propose a paradigm in which phrases are abbreviated aggressively as primarily word-initial letters. Our approach is to expand the abbreviations into full-phrase options by leveraging conversation context with the power of pretrained large language models (LLMs). Through zero-shot, few-shot, and fine-tuning experiments on four public conversation datasets, we show that for replies to the initial turn of a dialog, an LLM with 64B parameters is able to exactly expand over 70% of phrases with abbreviation length up to 10, leading to an effective keystroke saving rate of up to about 77% on these exact expansions. Including a small amount of context in the form of a single conversation turn more than doubles abbreviation expansion accuracies compared to having no context, an effect that is more pronounced for longer phrases. Additionally, the robustness of models against typo noise can be enhanced through fine-tuning on noisy data.

翻译：由于需要加快对有严重运动障碍的人的强化和替代交流(AAC)的文本输入,我们提出了一个模式,即以主要是字首字母的形式将短语缩写成主要为字首字母。我们的做法是利用预先培训的大型语言模型(LLMs)的力量来利用对话背景,将缩略语扩展为全句选项。通过零弹、微小和微调四个公开对话数据集的实验,我们显示,在对对话初始转弯的答复中,一个具有64B参数的LLM能够将缩写长度超过70%的短语完全扩展至10,从而在这些精确扩展中有效按键节约率达到大约77%。包括一个单一对话形式的小段次的缩略语扩展缩略语,而不是没有上下文的缩略语扩展。此外,通过对响声数据的微调微调整,可以提高模型的稳健性。

0

相关内容

语言模型化

语言模型化

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

内质网应激IRE1－XBP1S通路在高糖引起肾脏及系膜细胞发生氧化应激及损伤中的机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于NF-κB信号通路研究vaspin与leptin在骨性关节炎中的拮抗作用及分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

片仔癀干预骨肉瘤干细胞ABC转运蛋白及PI3K/AKt信号通路逆转耐药的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

mTOR信号通路介导SIRT3调控糖尿病肾病系膜细胞肥大的作用及分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

Where to Begin? Exploring the Impact of Pre-Training and Initialization in Federated Learning

Arxiv

0+阅读 · 2022年6月30日

Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems

Arxiv

0+阅读 · 2022年6月30日

A comparative study of scoring systems by simulations

Arxiv

0+阅读 · 2022年6月28日

Extracting Targeted Training Data from ASR Models, and How to Mitigate It

Arxiv

0+阅读 · 2022年6月28日

Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond

Arxiv

15+阅读 · 2020年5月13日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

星链与未来战争

《黑蜂（Black Hummingbird）微型无人机》

《全球地缘政治环境中的反无人机系统互操作性》252页

《美国：为自动驾驶汽车铺平道路——未来出行已来》最新43页报告

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Where to Begin? Exploring the Impact of Pre-Training and Initialization in Federated Learning

Arxiv

0+阅读 · 2022年6月30日

Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems

Arxiv

0+阅读 · 2022年6月30日

A comparative study of scoring systems by simulations

Arxiv

0+阅读 · 2022年6月28日

Extracting Targeted Training Data from ASR Models, and How to Mitigate It

Arxiv

0+阅读 · 2022年6月28日

Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond

Arxiv

15+阅读 · 2020年5月13日

相关基金

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

内质网应激IRE1－XBP1S通路在高糖引起肾脏及系膜细胞发生氧化应激及损伤中的机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于NF-κB信号通路研究vaspin与leptin在骨性关节炎中的拮抗作用及分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

片仔癀干预骨肉瘤干细胞ABC转运蛋白及PI3K/AKt信号通路逆转耐药的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

mTOR信号通路介导SIRT3调控糖尿病肾病系膜细胞肥大的作用及分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员