Great research interests have been attracted to devise AI services that are able to provide mental health support. However, the lack of corpora is a main obstacle to this research, particularly in Chinese language. In this paper, we propose PsyQA, a Chinese dataset of psychological health support in the form of question and answer pair. PsyQA is crawled from a Chinese mental health service platform, and contains 22K questions and 56K long and well-structured answers. Based on the psychological counseling theories, we annotate a portion of answer texts with typical strategies for providing support, and further present in-depth analysis of both lexical features and strategy patterns in the counseling answers. We also evaluate the performance of generating counseling answers with the generative pretrained models. Results show that utilizing strategies enhances the fluency and helpfulness of generated answers, but there is still a large space for future research.
翻译:在设计能够提供心理健康支持的AI服务方面,人们已经吸引了巨大的研究兴趣,然而,缺乏社团是这一研究的主要障碍,特别是中文研究的主要障碍。在本文中,我们建议采用问答方式,建立中国心理健康支持数据集,即PsyQA。PsyQA是从中国心理健康服务平台上爬出的,包含22K问题和56K长期和结构完善的答案。根据心理咨询理论,我们注意到有一部分回答文本,有提供支持的典型战略,并进一步深入分析咨询答案的词汇特征和战略模式。我们还评估了以基因化的预先培训模式产生咨询答案的绩效。结果显示,利用战略可以提高所产生答案的流畅和有用性,但今后仍有大量研究空间。