Despite the significant efforts made by the research community in recent years, automatically acquiring valuable information about high impact-events from social media remains challenging. We present EviDense, a graph-based approach for finding high-impact events (such as disaster events) in social media. One of the challenges we address in our work is to provide for each event a succinct keyword-based description, containing the most relevant information about it, such as what happened, the location, as well as its timeframe. We evaluate our approach on a large collection of tweets posted over a period of 19 months, using a crowdsourcing platform. Our evaluation shows that our method outperforms state-of-the-art approaches for the same problem, in terms of having higher precision, lower number of duplicates, and presenting a keyword-based description that is succinct and informative. We further improve the results of our algorithm by incorporating news from mainstream media. A preliminary version of this work was presented as a 4-pages short paper at ICWSM 2018.
翻译:尽管研究界近年来做出了巨大努力,但从社交媒体自动获得关于高影响活动的宝贵信息仍具有挑战性。我们介绍了EviDense,这是在社交媒体中找到高影响事件(如灾害事件)的图表式方法。我们在工作中要应对的挑战之一是为每个活动提供简明的关键词描述,其中载有关于它的最相关信息,例如所发生的事情、地点及其时间框架。我们利用一个众包平台评估了我们在19个月时间段内大量张贴的推特的收集方法。我们的评估表明,我们的方法在精确度更高、重复次数较少、并以关键词为基础描述简洁和资料丰富等方面,超过了对同一问题的最新方法。我们通过将主流媒体的新闻纳入到我们的工作算法的结果。我们的初步版本作为4页的短页论文在2018年ICWSMSM上作了介绍。