As political attitudes have diverged ideologically in the United States, political speech has diverged lingusitically. The ever-widening polarization between the US political parties is accelerated by an erosion of mutual understanding between them. We aim to make these communities more comprehensible to each other with a framework that probes community-specific responses to the same survey questions using community language models CommunityLM. In our framework we identify committed partisan members for each community on Twitter and fine-tune LMs on the tweets authored by them. We then assess the worldviews of the two groups using prompt-based probing of their corresponding LMs, with prompts that elicit opinions about public figures and groups surveyed by the American National Election Studies (ANES) 2020 Exploratory Testing Survey. We compare the responses generated by the LMs to the ANES survey results, and find a level of alignment that greatly exceeds several baseline methods. Our work aims to show that we can use community LMs to query the worldview of any group of people given a sufficiently large sample of their social media discussions or media diet.
翻译:由于美国的政治态度在意识形态上存在差异,政治言论在语言上存在差异。美国政党之间日益扩大的两极分化因相互谅解的侵蚀而加速。我们的目标是使这些社区更能相互理解,通过一个框架,用社区语言模型来调查社区对同一调查问题的具体答复。在我们的框架内,我们通过Twitter为每个社区确定承诺的党派成员,并在他们所发的推特上微调LMs。然后,我们利用对相应LMs的迅速调查来评估这两个团体的世界观,并迅速征求对公众人物和由美国全国选举研究2020年探索性测试调查调查所调查的群体的意见。我们将LMS的反应与ARES调查结果进行比较,并找出大大超过若干基线方法的一致程度。我们的工作旨在显示,我们可以利用社区LMs来查询任何群体的世界观,他们的社会媒体讨论或媒体饮食的样本足够多。