LLMInit：大型语言模型为推荐系统选择性初始化提供的免费午餐 (LLMInit: A Free Lunch from Large Language Models for Selective Initialization of Recommendation)

Collaborative filtering (CF) is widely adopted in industrial recommender systems (RecSys) for modeling user-item interactions across numerous applications, but often struggles with cold-start and data-sparse scenarios. Recent advancements in pre-trained large language models (LLMs) with rich semantic knowledge, offer promising solutions to these challenges. However, deploying LLMs at scale is hindered by their significant computational demands and latency. In this paper, we propose a novel and scalable LLM-RecSys framework, LLMInit, designed to integrate pretrained LLM embeddings into CF models through selective initialization strategies. Specifically, we identify the embedding collapse issue observed when CF models scale and match the large embedding sizes in LLMs and avoid the problem by introducing efficient sampling methods, including, random, uniform, and variance-based selections. Comprehensive experiments conducted on multiple real-world datasets demonstrate that LLMInit significantly improves recommendation performance while maintaining low computational costs, offering a practical and scalable solution for industrial applications. To facilitate industry adoption and promote future research, we provide open-source access to our implementation at https://github.com/DavidZWZ/LLMInit.

翻译：协同过滤（CF）在工业推荐系统中被广泛用于建模用户-物品交互，适用于众多应用场景，但在冷启动和数据稀疏情况下往往表现不佳。近期，预训练大型语言模型（LLM）凭借其丰富的语义知识，为这些挑战提供了有前景的解决方案。然而，LLM的大规模部署受限于其高昂的计算需求和延迟。本文提出了一种新颖且可扩展的LLM-推荐系统框架——LLMInit，旨在通过选择性初始化策略将预训练LLM嵌入集成到CF模型中。具体而言，我们识别了当CF模型扩展至匹配LLM的大嵌入尺寸时出现的嵌入坍缩问题，并通过引入高效的采样方法（包括随机、均匀和基于方差的选取）来避免该问题。在多个真实世界数据集上进行的综合实验表明，LLMInit在保持低计算成本的同时，显著提升了推荐性能，为工业应用提供了实用且可扩展的解决方案。为促进业界采用和推动未来研究，我们在https://github.com/DavidZWZ/LLMInit开源了实现代码。