痛苦的提及在心理健康记录文本中的识别：一种自然语言处理方法 (Identifying Mentions of Pain in Mental Health Records Text: A Natural Language Processing Approach)

Pain is a common reason for accessing healthcare resources and is a growing area of research, especially in its overlap with mental health. Mental health electronic health records are a good data source to study this overlap. However, much information on pain is held in the free text of these records, where mentions of pain present a unique natural language processing problem due to its ambiguous nature. This project uses data from an anonymised mental health electronic health records database. The data are used to train a machine learning based classification algorithm to classify sentences as discussing patient pain or not. This will facilitate the extraction of relevant pain information from large databases, and the use of such outputs for further studies on pain and mental health. 1,985 documents were manually triple-annotated for creation of gold standard training data, which was used to train three commonly used classification algorithms. The best performing model achieved an F1-score of 0.98 (95% CI 0.98-0.99).

翻译：痛苦是访问医疗资源的常见原因，也是一个与心理健康重叠的研究领域。心理健康电子健康记录是研究此重叠的良好数据来源。然而，许多关于疼痛的信息都保存在这些记录的自由文本中，由于其模糊的性质，疼痛的提及会产生独特的自然语言处理问题。本项目使用来自匿名的心理健康电子健康记录数据库的数据。使用这些数据训练基于机器学习的分类算法来将句子分类为讨论患者疼痛与否。这将有助于从大型数据库中提取相关疼痛信息，并将这些输出用于进一步研究疼痛和心理健康。共手动三次注释1,985个文档以创建金标准训练数据，用于训练三种常用的分类算法。最佳性能模型的 F1 分数为 0.98（95% CI 0.98-0.99）。

相关内容

健康

关注 27

健康是指一个人在身体、精神和社会等方面都处于良好的状态。健康包括两个方面的内容：

一是主要脏器无疾病，身体形态发育良好，体形均匀，人体各系统具有良好的生理功能，有较强的身体活动能力和劳动能力，这是对健康最基本的要求；

二是对疾病的抵抗能力较强，能够适应环境变化，各种生理刺激以及致病因素对身体的作用。传统的健康观是“无病即健康”，现代人的健康观是整体健康，世界卫生组织提出“健康不仅是躯体没有疾病，还要具备心理健康、社会适应良好和有道德”。因此，现代人的健康内容包括：躯体健康、心理健康、心灵健康、社会健康、智力健康、道德健康、环境健康等。健康是人的基本权利。健康是人生的第一财富。

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日