LSRP：一种面向隐私保护云-端协作的领导者-从属检索框架 (LSRP: A Leader-Subordinate Retrieval Framework for Privacy-Preserving Cloud-Device Collaboration)

Cloud-device collaboration leverages on-cloud Large Language Models (LLMs) for handling public user queries and on-device Small Language Models (SLMs) for processing private user data, collectively forming a powerful and privacy-preserving solution. However, existing approaches often fail to fully leverage the scalable problem-solving capabilities of on-cloud LLMs while underutilizing the advantage of on-device SLMs in accessing and processing personalized data. This leads to two interconnected issues: 1) Limited utilization of the problem-solving capabilities of on-cloud LLMs, which fail to align with personalized user-task needs, and 2) Inadequate integration of user data into on-device SLM responses, resulting in mismatches in contextual user information. In this paper, we propose a Leader-Subordinate Retrieval framework for Privacy-preserving cloud-device collaboration (LSRP), a novel solution that bridges these gaps by: 1) enhancing on-cloud LLM guidance to on-device SLM through a dynamic selection of task-specific leader strategies named as user-to-user retrieval-augmented generation (U-U-RAG), and 2) integrating the data advantages of on-device SLMs through small model feedback Direct Preference Optimization (SMFB-DPO) for aligning the on-cloud LLM with the on-device SLM. Experiments on two datasets demonstrate that LSRP consistently outperforms state-of-the-art baselines, significantly improving question-answer relevance and personalization, while preserving user privacy through efficient on-device retrieval. Our code is available at: https://github.com/Applied-Machine-Learning-Lab/LSRP.

翻译：云-端协作利用云端大型语言模型（LLMs）处理公共用户查询，并借助端侧小型语言模型（SLMs）处理私有用户数据，共同构成一种强大且保护隐私的解决方案。然而，现有方法往往未能充分利用云端LLMs的可扩展问题解决能力，同时未能充分发挥端侧SLMs在访问和处理个性化数据方面的优势。这导致了两个相互关联的问题：1）云端LLMs的问题解决能力利用有限，未能与个性化的用户任务需求对齐；2）用户数据未能充分整合到端侧SLM的响应中，导致上下文用户信息不匹配。本文提出了一种面向隐私保护云-端协作的领导者-从属检索框架（LSRP），该新颖方案通过以下方式弥合这些差距：1）通过动态选择任务特定的领导者策略（称为用户到用户检索增强生成，U-U-RAG），增强云端LLM对端侧SLM的指导；2）通过小型模型反馈直接偏好优化（SMFB-DPO）整合端侧SLMs的数据优势，使云端LLM与端侧SLM对齐。在两个数据集上的实验表明，LSRP始终优于现有最先进的基线方法，显著提高了问答相关性和个性化程度，同时通过高效的端侧检索保护用户隐私。我们的代码发布于：https://github.com/Applied-Machine-Learning-Lab/LSRP。