Complaining is a speech act that expresses a negative inconsistency between reality and human expectations. While prior studies mostly focus on identifying the existence or the type of complaints, in this work, we present the first study in computational linguistics of measuring the intensity of complaints from text. Analyzing complaints from such perspective is particularly useful, as complaints of certain degrees may cause severe consequences for companies or organizations. We create the first Chinese dataset containing 3,103 posts about complaints from Weibo, a popular Chinese social media platform. These posts are then annotated with complaints intensity scores using Best-Worst Scaling (BWS) method. We show that complaints intensity can be accurately estimated by computational models with the best mean square error achieving 0.11. Furthermore, we conduct a comprehensive linguistic analysis around complaints, including the connections between complaints and sentiment, and a cross-lingual comparison for complaints expressions used by Chinese and English speakers. We finally show that our complaints intensity scores can be incorporated for better estimating the popularity of posts on social media.
翻译:虽然先前的研究主要侧重于确定投诉的存在或类型,但我们在这项工作中提出了衡量文本投诉强度的计算语言学的第一项研究。从这种角度分析投诉特别有用,因为某些程度的投诉可能会对公司或组织造成严重后果。我们创建了第一个中国数据集,其中包含3,103个关于中国广受欢迎的社交媒体平台Weibo投诉的立项。然后,这些文章用最差的缩放(BWS)方法用投诉强度评分附加了评分。我们表明,通过计算模型可以准确估计投诉强度,而最佳平均差错则达到0.11。此外,我们对投诉进行全面的语言分析,包括投诉和情绪之间的联系,以及对中英语发言人使用的投诉表达方式进行跨语言比较。我们最后表明,我们的投诉强度评分可以纳入,以更好地估计社交媒体职位的受欢迎程度。