压缩定性的跨语言多任务模型 (Compressing Cross-Lingual Multi-Task Models at Qualtrics)

Experience management is an emerging business area where organizations focus on understanding the feedback of customers and employees in order to improve their end-to-end experiences. This results in a unique set of machine learning problems to help understand how people feel, discover issues they care about, and find which actions need to be taken on data that are different in content and distribution from traditional NLP domains. In this paper, we present a case study of building text analysis applications that perform multiple classification tasks efficiently in 12 languages in the nascent business area of experience management. In order to scale up modern ML methods on experience data, we leverage cross lingual and multi-task modeling techniques to consolidate our models into a single deployment to avoid overhead. We also make use of model compression and model distillation to reduce overall inference latency and hardware cost to the level acceptable for business needs while maintaining model prediction quality. Our findings show that multi-task modeling improves task performance for a subset of experience management tasks in both XLM-R and mBert architectures. Among the compressed architectures we explored, we found that MiniLM achieved the best compression/performance tradeoff. Our case study demonstrates a speedup of up to 15.61x with 2.60% average task degradation (or 3.29x speedup with 1.71% degradation) and estimated savings of 44% over using the original full-size model. These results demonstrate a successful scaling up of text classification for the challenging new area of ML for experience management.

翻译：风险管理是一个新兴商业领域,各组织侧重于了解客户和雇员的反馈,以改善其端到端经验。这导致了一系列独特的机器学习问题,帮助理解人们的感受,发现他们所关心的问题,并发现需要对传统NLP领域在内容和分布上不同的数据采取什么行动。在本文件中,我们介绍了在新经验管理新业务领域以12种语言高效执行多种分类任务的文本分析应用的案例研究。为了扩大现代ML方法的经验数据,我们利用多种语言和多任务模型技术,将我们的模型合并成一个单一部署以避免间接费用。我们还利用模型压缩和模型蒸馏方法,将总体推导力和硬件成本降低到商业需求可接受的水平,同时保持模型预测质量。我们的调查结果显示,多任务模型改进了XLM-R和mBert两个新业务领域一系列风险管理任务的业绩。在压缩结构中,我们探索了将MiniLMMA达到最佳压缩/业绩贸易规模,避免出现间接费用。我们的案例研究显示,MVIMM-L-L 71% 和44x 平均递增速度递增速度成本。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《校准自主性中的信任》2022最新16页slides

专知会员服务

20+阅读 · 2022年12月7日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日