With the rapid advancement of information and communication technologies, many researchers have adopted alternative data sources from private data vendors to study human movement dynamics in response to large-scale natural or societal events. Big geosocial data such as georeferenced tweets are publicly available and dynamically evolving as real-world events are happening, making it more likely to capture the real-time sentiments and responses of populations. However, precisely-geolocated geosocial data is scarce and biased toward urban population centers. In this research, we developed a big geosocial data analytical framework for extracting human movement dynamics in response to large-scale events from publicly available georeferenced tweets. The framework includes a two-stage data collection module that collects data in a more targeted fashion in order to mitigate the data scarcity issue of georeferenced tweets; in addition, a variable bandwidth kernel density estimation(VB-KDE) approach was adopted to fuse georeference information at different spatial scales, further augmenting the signals of human movement dynamics contained in georeferenced tweets. To correct for the sampling bias of georeferenced tweets, we adjusted the number of tweets for different spatial units (e.g., county, state) by population. To demonstrate the performance of the proposed analytic framework, we chose an astronomical event that occurred nationwide across the United States, i.e., the 2017 Great American Eclipse, as an example event and studied the human movement dynamics in response to this event. However, this analytic framework can easily be applied to other types of large-scale events such as hurricanes or earthquakes.
翻译:随着信息和通信技术的迅速发展,许多研究人员采用了来自私营数据供应商的替代数据来源,以研究人类运动动态,以应对大规模自然或社会事件;大型地球社会数据,如地理参考推文,是公开的,随着现实世界事件的发生而动态变化;然而,精确地理定位的地球社会数据稀少,偏向城市人口中心;在这项研究中,我们开发了一个大型地球社会数据分析框架,以针对大规模事件,从公开提供的地理参照推文中提取人类运动动态;这个框架包括一个两阶段的数据收集模块,以更有针对性的方式收集数据,以减轻地理参考推文的数据稀缺问题;此外,它更有可能捕捉到实时人口动态的实时情绪和实时反应;精确地理定位的地理社会数据数据数据数据数据数据数据很少,进一步增强地理参照推文所载人类运动动态的信号;为了对大规模地震事件进行抽样分析,我们调整了不同空间单位(如地理参考推文、国家、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、州、