Twitter is a microblogging service for sending short, public text messages (tweets) that has recently received more attention in scientific comunity. In the works of Sasaki et al. (2010) and Earle et al., (2011) the authors explored the real-time interaction on Twitter for detecting natural hazards (e.g., earthquakes, typhoons) baed on users' tweets. An inherent challenge for such an application is the natural language processing (NLP), which basically consists in converting the words in number (vectors and tensors) in order to (mathematically/ computationally) make predictions and classifications. Recently advanced computational tools have been made available for dealing with text computationally. In this report we implement a NLP machine learning with TensorFlow, an end-to-end open source plataform for machine learning applications, to process and classify evenct based on files containing only text.
翻译:Twitter是一种微型博客服务,用于发送短文本消息(推文),最近在科学界受到了更多的关注。在Sasaki等人(2010)和Earle等人的研究中,作者探讨了Twitter上的实时交互,以检测基于用户推文的自然灾害(例如地震、台风)等。这种应用的固有挑战是自然语言处理(NLP),其基本上是将单词转换为数字(向量和张量),以便进行预测和分类(数学/计算机)。最近,先进的计算工具已经可用于计算处理文本数据。在本报告中,我们使用TensorFlow进行NLP机器学习,这是一种端到端的开放源代码平台,用于处理只包含文本的文件并进行分类和预测。