Mental disorders represent a critical global health challenge, and social media is increasingly viewed as a vital resource for real-time digital phenotyping and intervention. To leverage this data, large language models (LLMs) have been introduced, offering stronger semantic understanding and reasoning than traditional deep learning, thereby enhancing the explainability of detection results. Despite the growing prominence of LLMs in this field, there is a scarcity of scholarly works that systematically synthesize how advanced enhancement techniques, specifically Retrieval-Augmented Generation (RAG) and Agentic systems, can be utilized to address these reliability and reasoning limitations. Here, we systematically survey the evolving landscape of LLM-based methods for social media mental disorder analysis, spanning standard pre-trained language models, RAG to mitigate hallucinations and contextual gaps, and agentic systems for autonomous reasoning and multi-step intervention. We organize existing work by technical paradigm and clinical target, extending beyond common internalizing disorders to include psychotic disorders and externalizing behaviors. Additionally, the paper comprehensively evaluates the performance of LLMs, including the impact of RAG, across various tasks. This work establishes a unified benchmark for the field, paving the way for the development of trustworthy, autonomous AI systems that can deliver precise and explainable mental health support.
翻译:心理健康障碍是一项严峻的全球性健康挑战,而社交媒体日益被视为实现实时数字表型分析和干预的重要资源。为利用此类数据,大语言模型被引入该领域,其相比传统深度学习方法提供了更强的语义理解与推理能力,从而提升了检测结果的可解释性。尽管大语言模型在该领域的重要性日益凸显,但系统性地综述如何利用先进的增强技术——特别是检索增强生成和智能体系统——来解决其可靠性与推理局限性的学术著作仍较为匮乏。本文系统性地综述了基于大语言模型的社交媒体心理健康障碍分析方法的演进图景,涵盖标准预训练语言模型、用于缓解幻觉与上下文鸿沟的RAG技术,以及实现自主推理与多步骤干预的智能体系统。我们依据技术范式与临床目标对现有研究进行梳理,其范围超越常见的内化障碍,延伸至精神病性障碍与外化行为。此外,本文全面评估了大语言模型在多项任务中的性能表现,包括RAG技术的影响。本研究为该领域建立了统一的基准,为开发能够提供精准、可解释心理健康支持的可信自主人工智能系统铺平了道路。