Named entity detection and Parts-of-speech tagging are the key tasks for many NLP applications. Although the current state of the art methods achieved near perfection for long, formal, structured text there are hindrances in deploying these models on memory-constrained devices such as mobile phones. Furthermore, the performance of these models is degraded when they encounter short, informal, and casual conversations. To overcome these difficulties, we present LiteMuL - a lightweight on-device sequence tagger that can efficiently process the user conversations using a Multi-Task Learning (MTL) approach. To the best of our knowledge, the proposed model is the first on-device MTL neural model for sequence tagging. Our LiteMuL model is about 2.39 MB in size and achieved an accuracy of 0.9433 (for NER), 0.9090 (for POS) on the CoNLL 2003 dataset. The proposed LiteMuL not only outperforms the current state of the art results but also surpasses the results of our proposed on-device task-specific models, with accuracy gains of up to 11% and model-size reduction by 50%-56%. Our model is competitive with other MTL approaches for NER and POS tasks while outshines them with a low memory footprint. We also evaluated our model on custom-curated user conversations and observed impressive results.
翻译:命名实体检测和部分语音标签是许多NLP应用程序的关键任务。尽管目前最新的最新先进方法对于长期、正规、结构化文本来说几乎完美,但在将这些模型安装在移动电话等记忆限制的设备上存在障碍。此外,这些模型的性能在遇到短时间、非正式和临时交谈时会退化。为了克服这些困难,我们提出LiteMuL-一个轻量级的配置序列塔格,它可以使用多任务学习(MTL)方法有效处理用户对话。根据我们的最佳知识,拟议的模型是第一个用于序列标记的在设计MTL神经模型上实现的近乎完美。我们的LiteMult MuL模型规模约为2.39MBMB,在CONLLL2003数据集中实现了0.9433(NER)、0.909090(POS)的准确性能。拟议的LiteMult Mult不仅超越了当前艺术成果的状态,而且超过了我们拟议的具体任务模型的结果,其精确率达到11 % MTL神经神经模型,同时将我们的模型和模型缩小了我们的50个模式。