Multilingual speakers tend to alternate between languages within a conversation, a phenomenon referred to as "code-switching" (CS). CS is a complex phenomenon that not only encompasses linguistic challenges, but also contains a great deal of complexity in terms of its dynamic behaviour across speakers. This dynamic behaviour has been studied by sociologists and psychologists, identifying factors affecting CS. In this paper, we provide an empirical user study on Arabic-English CS, where we show the correlation between users' CS frequency and character traits. We use machine learning (ML) to validate the findings, informing and confirming existing theories. The predictive models were able to predict users' CS frequency with an accuracy higher than 55%, where travel experiences and personality traits played the biggest role in the modeling process.
翻译:多语言的发言者倾向于在对话中在不同语言之间互换,这是一种被称为“代码转换”的现象。 CS是一种复杂的现象,不仅包括语言挑战,而且从不同发言者的动态行为来看也包含许多复杂因素。这种动态行为已经由社会学家和心理学家研究过,确定了影响 CS 的因素。在本文中,我们提供了关于阿拉伯语-英语 CS 的经验性用户研究,其中显示了用户 CS 频率和性格特征之间的相互关系。我们用机器学习来验证调查结果,告知和确认现有的理论。预测模型能够准确预测用户 CS 频率超过55%,其中旅行经验和个性特征在建模过程中发挥了最大作用。