Measuring public opinion is a key focus during democratic elections, enabling candidates to gauge their popularity and alter their campaign strategies accordingly. Traditional survey polling remains the most popular estimation technique, despite its cost and time intensity, measurement errors, lack of real-time capabilities and lagged representation of public opinion. In recent years, Twitter opinion mining has attempted to combat these issues. Despite achieving promising results, it experiences its own set of shortcomings such as an unrepresentative sample population and a lack of long term stability. This paper aims to merge data from both these techniques using Bayesian data assimilation to arrive at a more accurate estimate of true public opinion for the Brexit referendum. This paper demonstrates the effectiveness of the proposed approach using Twitter opinion data and survey data from trusted pollsters. Firstly, the possible existence of a time gap of 16 days between the two data sets is identified. This gap is subsequently incorporated into a proposed assimilation architecture. This method was found to adequately incorporate information from both sources and measure a strong upward trend in Leave support leading up to the Brexit referendum. The proposed technique provides useful estimates of true opinion, which is essential to future opinion measurement and forecasting research.
翻译:衡量民意是民主选举期间的一个关键重点,使候选人能够评估其受欢迎程度并相应改变竞选战略。传统民意调查仍然是最受欢迎的估计技术,尽管其成本和时间密集、计量错误、缺乏实时能力以及民意代表滞后。近年来,在推特上挖掘舆论试图解决这些问题。尽管取得了可喜的成果,但它也经历了其自身的一系列缺陷,如缺乏代表性的抽样人口和长期稳定性。本文件的目的是利用巴耶斯数据同化,将这两种技术的数据合并起来,以便更准确地估计布雷希特全民投票的真正公众舆论。本文展示了使用Twitter舆论数据和可靠民意调查数据的拟议方法的有效性。首先,确定了两个数据集之间可能存在16天的时间差距。这一差距随后被纳入了拟议的同化结构。这一方法被认为充分纳入了来自两个来源的信息,并衡量了布雷西特全民投票前休假支持的强劲上升趋势。拟议方法提供了对真实舆论的有用估计,这是未来意见计量和预测研究的关键。