Public opinion reflects and shapes societal behavior, but the traditional survey-based tools to measure it are limited. We introduce a novel approach to probe media diet models -- language models adapted to online news, TV broadcast, or radio show content -- that can emulate the opinions of subpopulations that have consumed a set of media. To validate this method, we use as ground truth the opinions expressed in U.S. nationally representative surveys on COVID-19 and consumer confidence. Our studies indicate that this approach is (1) predictive of human judgements found in survey response distributions and robust to phrasing and channels of media exposure, (2) more accurate at modeling people who follow media more closely, and (3) aligned with literature on which types of opinions are affected by media consumption. Probing language models provides a powerful new method for investigating media effects, has practical applications in supplementing polls and forecasting public opinion, and suggests a need for further study of the surprising fidelity with which neural language models can predict human responses.
翻译:公共舆论反映并塑造着社会行为,但传统的基于调查的舆论测量工具存在局限性。我们引入了一种新的方法来探究媒体饮食模型——语言模型,这些模型适应于在线新闻、电视广播或广播节目内容,可以模拟已经使用了一组媒体的亚群体的观点。为了验证这种方法,我们以美国全国代表性调查中有关 COVID-19 和消费者信心的观点表述为基准真实性。我们的研究表明,这种方法(1)能够预测调查回答分布中人类判断的准确性和媒体暴露的措辞和渠道的稳健性,(2)对于模拟更密切关注媒体的人来说更准确,(3)与关于哪些观点受媒体消费影响的文献一致。探究语言模型提供了一种强大的新方法来研究媒体影响,在补充民意调查和预测公众舆论方面具有实际应用价值,并暗示了需要进一步研究神经语言模型以惊人的忠实程度来预测人类反应。