Spelling variation (e.g. funnnn vs. fun) can influence the social perception of texts and their writers: we often have various associations with different forms of writing (is the text informal? does the writer seem young?). In this study, we focus on the social perception of spelling variation in online writing in English and study to what extent this perception is aligned between humans and large language models (LLMs). Building on sociolinguistic methodology, we compare LLM and human ratings on three key social attributes of spelling variation (formality, carefulness, age). We find generally strong correlations in the ratings between humans and LLMs. However, notable differences emerge when we analyze the distribution of ratings and when comparing between different types of spelling variation.
翻译:拼写变体(例如funnnn与fun)会影响文本及其作者的社会感知:我们通常对不同写作形式持有多种联想(文本是否非正式?作者是否显得年轻?)。本研究聚焦于在线英语写作中拼写变体的社会感知,并探究人类与大型语言模型(LLMs)在此感知上的对齐程度。基于社会语言学方法论,我们比较了LLMs与人类对拼写变体三个关键社会属性(正式性、细致度、年龄感)的评分。研究发现,人类与LLMs的评分总体上呈现强相关性。然而,在分析评分分布及比较不同类型拼写变体时,出现了显著差异。