LiteMuL: 使用多任务学习的轻量级上配置序列塔格 (LiteMuL: A Lightweight On-Device Sequence Tagger using Multi-task Learning)

Named entity detection and Parts-of-speech tagging are the key tasks for many NLP applications. Although the current state of the art methods achieved near perfection for long, formal, structured text there are hindrances in deploying these models on memory-constrained devices such as mobile phones. Furthermore, the performance of these models is degraded when they encounter short, informal, and casual conversations. To overcome these difficulties, we present LiteMuL - a lightweight on-device sequence tagger that can efficiently process the user conversations using a Multi-Task Learning (MTL) approach. To the best of our knowledge, the proposed model is the first on-device MTL neural model for sequence tagging. Our LiteMuL model is about 2.39 MB in size and achieved an accuracy of 0.9433 (for NER), 0.9090 (for POS) on the CoNLL 2003 dataset. The proposed LiteMuL not only outperforms the current state of the art results but also surpasses the results of our proposed on-device task-specific models, with accuracy gains of up to 11% and model-size reduction by 50%-56%. Our model is competitive with other MTL approaches for NER and POS tasks while outshines them with a low memory footprint. We also evaluated our model on custom-curated user conversations and observed impressive results.

翻译：命名实体检测和部分语音标签是许多NLP应用程序的关键任务。尽管目前最新的最新先进方法对于长期、正规、结构化文本来说几乎完美,但在将这些模型安装在移动电话等记忆限制的设备上存在障碍。此外,这些模型的性能在遇到短时间、非正式和临时交谈时会退化。为了克服这些困难,我们提出LiteMuL-一个轻量级的配置序列塔格,它可以使用多任务学习(MTL)方法有效处理用户对话。根据我们的最佳知识,拟议的模型是第一个用于序列标记的在设计MTL神经模型上实现的近乎完美。我们的LiteMult MuL模型规模约为2.39MBMB,在CONLLL2003数据集中实现了0.9433(NER)、0.909090(POS)的准确性能。拟议的LiteMult Mult不仅超越了当前艺术成果的状态,而且超过了我们拟议的具体任务模型的结果,其精确率达到11 % MTL神经神经模型,同时将我们的模型和模型缩小了我们的50个模式。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【COLING2020】无监督依存解析的综述论文，12页pdf

专知会员服务

16+阅读 · 2020年10月27日