Public knowledge of what is said in parliament is a tenet of democracy, and a critical resource for political science research. In Australia, following the British tradition, the written record of what is said in parliament is known as Hansard. While the Australian Hansard has always been publicly available, it has been difficult to use for the purpose of large-scale macro- and micro-level text analysis because it has only been available as PDFs or XMLs. Following the lead of the Linked Parliamentary Data project which achieved this for Canada, we provide a new, comprehensive, high-quality, rectangular database that captures proceedings of the Australian parliamentary debates from 1998 to 2022. The database is publicly available and can be linked to other datasets such as election results. The creation and accessibility of this database enables the exploration of new questions and serves as a valuable resource for both researchers and policymakers.
翻译:公众了解议会发言内容是民主的基本原则,也是政治科学研究的重要资源。在澳大利亚,遵循英国传统,议会发言记录的书面记录被称为汉萨德记录。虽然澳大利亚的汉萨德记录一直公开可用,但它一直很难用于大规模的文本分析,因为它只是作为PDF或XML格式提供。在加拿大会议数据链接项目的带领下,我们提供了一个新的、全面的、高质量的、矩形数据库,它捕捉了1998年至2022年的澳大利亚国会辩论记录。该数据库是公开可用的,并可与其他数据集(如选举结果)进行链接。该数据库的创建和可访问性使得可以探索新的问题,并为研究人员和政策制定者提供了宝贵的资源。