Real-time social media data can provide useful information on evolving hazards. Alongside traditional methods of disaster detection, the integration of social media data can considerably enhance disaster management. In this paper, we investigate the problem of detecting geolocation-content communities on Twitter and propose a novel distributed system that provides in near real-time information on hazard-related events and their evolution. We show that content-based community analysis leads to better and faster dissemination of reports on hazards. Our distributed disaster reporting system analyzes the social relationship among worldwide geolocated tweets, and applies topic modeling to group tweets by topics. Considering for each tweet the following information: user, timestamp, geolocation, retweets, and replies, we create a publisher-subscriber distribution model for topics. We use content similarity and the proximity of nodes to create a new model for geolocation-content based communities. Users can subscribe to different topics in specific geographical areas or worldwide and receive real-time reports regarding these topics. As misinformation can lead to increase damage if propagated in hazards related tweets, we propose a new deep learning model to detect fake news. The misinformed tweets are then removed from display. We also show empirically the scalability capabilities of the proposed system.
翻译:除了传统的灾害探测方法外,整合社交媒体数据可以大大加强灾害管理。在本文中,我们调查在推特上探测地理定位内容社区的问题,并提出一个以近实时信息提供灾害相关事件及其演变的新型分布系统。我们显示,基于内容的社区分析可以更好和更快地传播关于灾害的报告。我们分布式灾害报告系统分析全球地理定位的推文之间的社会关系,并将专题模型用于按主题对集体推文进行模拟。考虑到每一条推文的以下信息:用户、时间戳、地理定位、retweet和答复,我们为专题创建了一个出版商订阅器分发模式。我们使用内容相似和节点的近距离为基于地理定位的事件及其演变创建一个新的模式。用户可以认同特定地理区域或全球范围内的不同专题,并接收关于这些专题的实时报告。如果在与灾害有关的推文中传播,错误信息可能会增加损害。我们建议一个新的深层次学习模式来检测假消息。我们提出的推介的推文能力也从展示中移除。