基于高斯过程回归与LLM相关性判别的多模态项目评分用于自然语言推荐 (Multimodal Item Scoring for Natural Language Recommendation via Gaussian Process Regression with LLM Relevance Judgments)

Natural Language Recommendation (NLRec) generates item suggestions based on the relevance between user-issued NL requests and NL item description passages. Existing NLRec approaches often use Dense Retrieval (DR) to compute item relevance scores from aggregation of inner products between user request embeddings and relevant passage embeddings. However, DR views the request as the sole relevance label, thus leading to a unimodal scoring function centered on the query embedding that is often a weak proxy for query relevance. To better capture the potential multimodal distribution of the relevance scoring function that may arise from complex NLRec data, we propose GPR-LLM that uses Gaussian Process Regression (GPR) with LLM relevance judgments for a subset of candidate passages. Experiments on four NLRec datasets and two LLM backbones demonstrate that GPR-LLM with an RBF kernel, capable of modeling multimodal relevance scoring functions, consistently outperforms simpler unimodal kernels (dot product, cosine similarity), as well as baseline methods including DR, cross-encoder, and pointwise LLM-based relevance scoring by up to 65%. Overall, GPR-LLM provides an efficient and effective approach to NLRec within a minimal LLM labeling budget.

翻译：自然语言推荐（NLRec）通过用户发出的自然语言请求与自然语言项目描述段落之间的相关性来生成项目建议。现有的NLRec方法通常使用密集检索（DR），通过聚合用户请求嵌入与相关段落嵌入的内积来计算项目相关性分数。然而，DR将请求视为唯一的关联标签，从而产生以查询嵌入为中心的单模态评分函数，这通常是对查询相关性的弱代理。为了更好地捕捉复杂NLRec数据中可能出现的相关性评分函数的多模态分布，我们提出了GPR-LLM，该方法利用高斯过程回归（GPR）结合对候选段落子集的LLM相关性判别。在四个NLRec数据集和两个LLM骨干网络上的实验表明，采用能够建模多模态相关性评分函数的RBF核的GPR-LLM，持续优于简单的单模态核（点积、余弦相似度），以及包括DR、交叉编码器和基于LLM的点式相关性评分在内的基线方法，性能提升高达65%。总体而言，GPR-LLM在最小的LLM标注预算内，为NLRec提供了一种高效且有效的方法。