Catastrophic events create uncertain situations for humanitarian organizations locating and providing aid to affected people. Many people turn to social media during disasters for requesting help and/or providing relief to others. However, the majority of social media posts seeking help could not properly be detected and remained concealed because often they are noisy and ill-formed. Existing systems lack in planning an effective strategy for tweet preprocessing and grasping the contexts of tweets. This research, first of all, formally defines request tweets in the context of social networking sites, hereafter rweets, along with their different primary types and sub-types. Our main contributions are the identification and categorization of rweets. For rweet identification, we employ two approaches, namely a rule-based and logistic regression, and show their high precision and F1 scores. The rweets classification into sub-types such as medical, food, and shelter, using logistic regression shows promising results and outperforms existing works. Finally, we introduce an architecture to store intermediate data to accelerate the development process of the machine learning classifiers.
翻译:灾难性事件给人道主义组织寻找和向受影响者提供援助造成了不确定的情况。许多人在灾害期间求助于社交媒体请求帮助和(或)向他人提供救济。然而,大多数寻求帮助的社交媒体职位无法被正确发现,而且由于往往吵闹和不完善而一直隐藏起来。现有系统在为推文预处理和掌握推文背景制定有效战略方面缺乏规划。首先,这一研究在社会网络网站(下称Rweets)中正式界定了请求推文,以及它们的不同主要类型和子类型。我们的主要贡献是识别和分类rweet。关于rweet的识别,我们采用两种方法,即基于规则和后勤的回归,显示其高度精确性和F1分数。Rweets分类为医疗、食品和住房等子类型,使用后勤回归显示有希望的结果并超越了现有工作。最后,我们引入了一种结构,储存中间数据,以加速机器学习分类者的发展进程。</s>