In democratic countries, the ideology landscape is foundational to individual and collective political action; conversely, fringe ideology drives Ideologically Motivated Violent Extremism (IMVE). Therefore, quantifying ideology is a crucial first step to an ocean of downstream problems, such as; understanding and countering IMVE, detecting and intervening in disinformation campaigns, and broader empirical opinion dynamics modeling. However, online ideology detection faces two significant hindrances. Firstly, the ground truth that forms the basis for ideology detection is often prohibitively labor-intensive for practitioners to collect, requires access to domain experts and is specific to the context of its collection (i.e., time, location, and platform). Secondly, to circumvent this expense, researchers generate ground truth via other ideological signals (like hashtags used or politicians followed). However, the bias this introduces has not been quantified and often still requires expert intervention. This work presents an end-to-end ideology detection pipeline applicable to large-scale datasets. We construct context-agnostic and automatic ideological signals from widely available media slant data; show the derived pipeline is performant, compared to pipelines of common ideology signals and state-of-the-art baselines; employ the pipeline for left-right ideology, and (the more concerning) detection of extreme ideologies; generate psychosocial profiles of the inferred ideological groups; and, generate insights into their morality and preoccupations.
翻译:在民主国家中,意识形态景观对于个人和集体政治行动是基础性的;相反,边缘意识形态驱动着意识形态动机的暴力极端主义(IMVE)。因此,量化意识形态是一个重要的第一步,面向下游的问题是海量的,如理解和对抗IMVE,检测和干扰虚假信息活动,以及更广泛的经验意见动态建模。但是,在线意识形态检测面临两个重大障碍。首先,形成意识形态检测的基础的基本素材往往对于从业者收集来说过于费力,需要专业人士的支持,而且也要具体到其收集的上下文(例如时间、地点和平台)。其次,为了规避这种开支,研究人员通过其他意识形态信号(例如使用的hashtag或追随的政治家)来生成基础素材。但是,这种引入的偏差没有被量化,并且通常仍需要专业人员干预。本文提出了一种可应用于大规模数据集的端到端意识形态检测流程。我们从广泛可用的媒体倾向数据构建无上下文和自动的意识形态信号;展示了衍生的管道与常见意识形态信号和最先进的基线管道相比性能良好;将管道用于左右意识形态,以及(更为令人关注的)极端意识形态的检测;生成推断出的意识形态群体的心理社会概况;并且,生成了他们的道德和关注点的见解。