LLM-MemCluster：为大型语言模型赋能动态记忆以实现文本聚类 (LLM-MemCluster: Empowering Large Language Models with Dynamic Memory for Text Clustering) - 专知论文

会员服务 ·

0

文本聚类 · 大语言模型 · 端到端 · 语言模型 · 语义理解 ·

LLM-MemCluster: Empowering Large Language Models with Dynamic Memory for Text Clustering

翻译：LLM-MemCluster：为大型语言模型赋能动态记忆以实现文本聚类

Yuanjie Zhu,Liangwei Yang,Ke Xu,Weizhi Zhang,Zihe Song,Jindong Wang,Philip S. Yu

Large Language Models (LLMs) are reshaping unsupervised learning by offering an unprecedented ability to perform text clustering based on their deep semantic understanding. However, their direct application is fundamentally limited by a lack of stateful memory for iterative refinement and the difficulty of managing cluster granularity. As a result, existing methods often rely on complex pipelines with external modules, sacrificing a truly end-to-end approach. We introduce LLM-MemCluster, a novel framework that reconceptualizes clustering as a fully LLM-native task. It leverages a Dynamic Memory to instill state awareness and a Dual-Prompt Strategy to enable the model to reason about and determine the number of clusters. Evaluated on several benchmark datasets, our tuning-free framework significantly and consistently outperforms strong baselines. LLM-MemCluster presents an effective, interpretable, and truly end-to-end paradigm for LLM-based text clustering.

翻译：大型语言模型（LLMs）凭借其深层的语义理解能力，正在重塑无监督学习领域，展现出前所未有的文本聚类性能。然而，其直接应用受到根本性限制：缺乏用于迭代优化的状态记忆，以及难以有效管理聚类粒度。因此，现有方法通常依赖包含外部模块的复杂流程，牺牲了真正的端到端处理方式。本文提出LLM-MemCluster，一种新颖的框架，将聚类重新构想为完全由LLM原生执行的任务。该框架利用动态记忆（Dynamic Memory）来注入状态感知能力，并通过双提示策略（Dual-Prompt Strategy）使模型能够推理并确定聚类数量。在多个基准数据集上的评估表明，我们的免调优框架显著且持续地超越了现有强基线方法。LLM-MemCluster为基于LLM的文本聚类提供了一种高效、可解释且真正端到端的范式。

0

相关内容

文本聚类

文本聚类（Text Clustering）任务则是根据文档之间的内容或主题相似度，将文档集合划分成若干个子集，每个子集内部的文档相似度较高，而子集之间的相似度较低。

【NeurIPS2024】CA-SSLR：面向广义语音处理的条件感知自监督学习表征

【NeurIPS2024】CA-SSLR：面向广义语音处理的条件感知自监督学习表征

专知会员服务

15+阅读 · 2024年12月6日

【KDD2024】HiGPT:异构图语言模型

【KDD2024】HiGPT:异构图语言模型

专知会员服务

19+阅读 · 2024年7月9日

ChatAug: 利用ChatGPT进行文本数据增强

ChatAug: 利用ChatGPT进行文本数据增强

专知会员服务

81+阅读 · 2023年3月4日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

【ACMMM2020-北航】KBGN:用于视觉对话中自适应视觉-文本推理的知识桥图网络

【ACMMM2020-北航】KBGN:用于视觉对话中自适应视觉-文本推理的知识桥图网络

专知

10+阅读 · 2020年8月12日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

开放知识图谱

22+阅读 · 2018年9月26日

Facebook开源MUSE：多语言无监督和监督词向量库

Facebook开源MUSE：多语言无监督和监督词向量库

论智

20+阅读 · 2017年12月23日

语义Web知识库补全关键技术研究

国家自然科学基金

17+阅读 · 2017年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

高维复杂结构数据降维

国家自然科学基金

10+阅读 · 2014年12月31日

DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning

Arxiv

0+阅读 · 12月22日

AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models

Arxiv

0+阅读 · 12月22日

ChemATP: A Training-Free Chemical Reasoning Framework for Large Language Models

Arxiv

0+阅读 · 12月22日

ABE-CLIP: Training-Free Attribute Binding Enhancement for Compositional Image-Text Matching

Arxiv

0+阅读 · 12月19日

ZKPROV: A Zero-Knowledge Approach to Dataset Provenance for Large Language Models

Arxiv

0+阅读 · 12月18日

VIP会员

文章信息

相关主题

大语言模型

相关VIP内容

【NeurIPS2024】CA-SSLR：面向广义语音处理的条件感知自监督学习表征

【NeurIPS2024】CA-SSLR：面向广义语音处理的条件感知自监督学习表征

专知会员服务

15+阅读 · 2024年12月6日

【KDD2024】HiGPT:异构图语言模型

【KDD2024】HiGPT:异构图语言模型

专知会员服务

19+阅读 · 2024年7月9日

ChatAug: 利用ChatGPT进行文本数据增强

ChatAug: 利用ChatGPT进行文本数据增强

专知会员服务

81+阅读 · 2023年3月4日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《利用人工智能改善军事警察行动：当下现状探索》最新95页报告

《用于适应性、任务就绪型军用仿生机器人的合成数据管道》

面向现代武装力量的高级AI驱动军事模拟与训练软件

《军事应用中的AI：建立信任》最新报告

相关资讯

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

【ACMMM2020-北航】KBGN:用于视觉对话中自适应视觉-文本推理的知识桥图网络

【ACMMM2020-北航】KBGN:用于视觉对话中自适应视觉-文本推理的知识桥图网络

专知

10+阅读 · 2020年8月12日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知

15+阅读 · 2020年7月23日

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

论文浅尝 | 当知识图谱遇上零样本学习——零样本学习综述

开放知识图谱

22+阅读 · 2018年9月26日

Facebook开源MUSE：多语言无监督和监督词向量库

Facebook开源MUSE：多语言无监督和监督词向量库

论智

20+阅读 · 2017年12月23日

相关论文

DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning

Arxiv

0+阅读 · 12月22日

AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models

Arxiv

0+阅读 · 12月22日

ChemATP: A Training-Free Chemical Reasoning Framework for Large Language Models

Arxiv

0+阅读 · 12月22日

ABE-CLIP: Training-Free Attribute Binding Enhancement for Compositional Image-Text Matching

Arxiv

0+阅读 · 12月19日

ZKPROV: A Zero-Knowledge Approach to Dataset Provenance for Large Language Models

Arxiv

0+阅读 · 12月18日

相关基金

语义Web知识库补全关键技术研究

国家自然科学基金

17+阅读 · 2017年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

高维复杂结构数据降维

国家自然科学基金

10+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员