Theory-of-mind (ToM), a human ability to infer the intentions and thoughts of others, is an essential part of empathetic experiences. We provide here the framework for using NLP models to measure ToM expressed in written texts. For this purpose, we introduce ToM-Diary, a crowdsourced 18,238 diaries with 74,014 Korean sentences annotated with different ToM levels. Each diary was annotated with ToM levels by trained psychology students and reviewed by selected psychology experts. The annotators first divided the diaries based on whether they mentioned other people: self-focused and other-focused. Examples of self-focused sentences are "I am feeling good". The other-focused sentences were further classified into different levels. These levels differ by whether the writer 1) mentions the presence of others without inferring their mental state(e.g., I saw a man walking down the street), 2) fails to take the perspective of others (e.g., I don't understand why they refuse to wear masks), or 3) successfully takes the perspective of others (It must have been hard for them to continue working). We tested whether state-of-the-art transformer-based models (e.g., BERT) could predict underlying ToM levels in sentences. We found that BERT more successfully detected self-focused sentences than other-focused ones. Sentences that successfully take the perspective of others (the highest ToM level) were the most difficult to predict. Our study suggests a promising direction for large-scale and computational approaches for identifying the ability of authors to empathize and take the perspective of others. The dataset is at [URL](https://github.com/humanfactorspsych/covid19-tom-empathy-diary)
翻译:人类测算他人意图和思想的能力( ToM) 是人类测算他人意图和思想的能力, 是同情经历的一个基本部分。 我们在此提供了使用 NLP 模型来测量书面文本表达的 ToM 模型的框架 。 为此, 我们引入了 ToM-Dairy, 共18, 238 日记, 包含74, 014 韩元, 并附加了74, 014 韩元。 每本日记都由受过培训的心理学学生以 ToM 水平附加说明。 警告者首先根据是否提及其他人来区分日志: 以自我为焦点, 以其它人为焦点。 自重的句子示例是“我感觉良好 ” 。 其它重的句子被进一步分类为不同层次 。 我们测试了其他人的存在, 而没有推断他们的精神状态( 例如, 我看到一个人在街上行走)、 2 无法理解其他人的观点( 例如, 我无法理解他们为什么拒绝戴面具) 或3 成功理解其他人的观点( 我们最难理解他们选择的直觉方法 ) ( ) 以大的直观为模型/ 方向, 对于他们来说, 变动的模型, 更难于他们要继续运行 。