Experience management is an emerging business area where organizations focus on understanding the feedback of customers and employees in order to improve their end-to-end experiences. This results in a unique set of machine learning problems to help understand how people feel, discover issues they care about, and find which actions need to be taken on data that are different in content and distribution from traditional NLP domains. In this paper, we present a case study of building text analysis applications that perform multiple classification tasks efficiently in 12 languages in the nascent business area of experience management. In order to scale up modern ML methods on experience data, we leverage cross lingual and multi-task modeling techniques to consolidate our models into a single deployment to avoid overhead. We also make use of model compression and model distillation to reduce overall inference latency and hardware cost to the level acceptable for business needs while maintaining model prediction quality. Our findings show that multi-task modeling improves task performance for a subset of experience management tasks in both XLM-R and mBert architectures. Among the compressed architectures we explored, we found that MiniLM achieved the best compression/performance tradeoff. Our case study demonstrates a speedup of up to 15.61x with 2.60% average task degradation (or 3.29x speedup with 1.71% degradation) and estimated savings of 44% over using the original full-size model. These results demonstrate a successful scaling up of text classification for the challenging new area of ML for experience management.
翻译:风险管理是一个新兴商业领域,各组织侧重于了解客户和雇员的反馈,以改善其端到端经验。这导致了一系列独特的机器学习问题,帮助理解人们的感受,发现他们所关心的问题,并发现需要对传统NLP领域在内容和分布上不同的数据采取什么行动。在本文件中,我们介绍了在新经验管理新业务领域以12种语言高效执行多种分类任务的文本分析应用的案例研究。为了扩大现代ML方法的经验数据,我们利用多种语言和多任务模型技术,将我们的模型合并成一个单一部署以避免间接费用。我们还利用模型压缩和模型蒸馏方法,将总体推导力和硬件成本降低到商业需求可接受的水平,同时保持模型预测质量。我们的调查结果显示,多任务模型改进了XLM-R和mBert两个新业务领域一系列风险管理任务的业绩。在压缩结构中,我们探索了将MiniLMMA达到最佳压缩/业绩贸易规模,避免出现间接费用。 我们的案例研究显示,MVIMM-L-L 71% 和44x 平均递增速度递增速度成本。