Parliamentary and legislative debate transcripts provide an exciting insight into elected politicians' opinions, positions, and policy preferences. They are interesting for political and social sciences as well as linguistics and natural language processing (NLP). Exiting research covers discussions within individual parliaments. In contrast, we apply advanced NLP methods to a joint and comparative analysis of six national parliaments (Bulgarian, Czech, French, Slovene, Spanish, and United Kingdom) between 2017 and 2020, whose transcripts are a part of the ParlaMint dataset collection. Using a uniform methodology, we analyze topics discussed, emotions, and sentiment. We assess if the age, gender, and political orientation of speakers can be detected from speeches. The results show some commonalities and many surprising differences among the analyzed countries.
翻译:议会和立法辩论记录誊本对当选政治家的意见、立场和政策偏好提供了令人振奋的洞察力,对政治和社会科学以及语言和自然语言处理很感兴趣。退出研究涵盖个别议会内部的讨论。相比之下,我们采用先进的国家议会记录誊本方法对2017年至2020年期间的六个国家议会(保加利亚、捷克、法国、斯洛文尼亚、西班牙和联合王国)进行联合和比较分析,这些议会的记录是ParlaMint数据集收集的一部分。我们使用统一的方法分析讨论的议题、情绪和情绪。我们评估演讲者的年龄、性别和政治取向是否可从演讲中检测出来。结果显示受分析国家之间有一些共同之处和许多令人惊讶的差异。