Tiny-NewsRec:基于PLM的高效率、高成效和高效率的新闻建议 (Tiny-NewsRec: Effective and Efficient PLM-based News Recommendation)

News recommendation is a widely adopted technique to provide personalized news feeds for the user. Recently, pre-trained language models (PLMs) have demonstrated the great capability of natural language understanding and benefited news recommendation via improving news modeling. However, most existing works simply finetune the PLM with the news recommendation task, which may suffer from the known domain shift problem between the pre-training corpus and downstream news texts. Moreover, PLMs usually contain a large volume of parameters and have high computational overhead, which imposes a great burden on low-latency online services. In this paper, we propose Tiny-NewsRec, which can improve both the effectiveness and the efficiency of PLM-based news recommendation. We first design a self-supervised domain-specific post-training method to better adapt the general PLM to the news domain with a contrastive matching task between news titles and news bodies. We further propose a two-stage knowledge distillation method to improve the efficiency of the large PLM-based news recommendation model while maintaining its performance. Multiple teacher models originated from different time steps of our post-training procedure are used to transfer comprehensive knowledge to the student in both its post-training and finetuning stage. Extensive experiments on two real-world datasets validate the effectiveness and efficiency of our method.

翻译：最近,经过预先培训的语文模式(PLM)展示了自然语言理解的巨大能力,并通过改进新闻模型使新闻建议受益。然而,大多数现有作品只是对PLM的新闻建议任务进行微调,这可能会受到培训前材料和下游新闻文本之间已知的域变问题的影响。此外,PLM通常包含大量参数和高计算间接费用,给低纬度在线服务带来沉重负担。在本文中,我们提议Tini-NewsRec,这可以提高基于PLM的新闻建议的效力和效率。我们首先设计了一种自我监督的特定域域别培训后方法,使一般PLM更好地适应新闻领域,而新闻标题和下游新闻文本之间的相对应任务。我们进一步提出一个两阶段的知识蒸馏方法,以提高基于PLM的大型新闻建议模式的效率,同时保持其性能。来自我们培训后程序不同时间步骤的多种教师模式,用于在后期将全面知识转让给学生,在后期测试阶段和数据效率测试的两个阶段,即将全面知识转让给学生。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日