Talaia is a platform for monitoring social media and digital press. A configurable crawler gathers content with respect to user defined domains or topics. Crawled data is processed by means of IXA-pipes NLP chain and EliXa sentiment analysis system. A Django powered interface provides data visualization to provide the user analysis of the data. This paper presents the architecture of the system and describes in detail the different components of the system. To prove the validity of the approach, two real use cases are accounted for, one in the cultural domain and one in the political domain. Evaluation for the sentiment analysis task in both scenarios is also provided, showing the capacity for domain adaptation.
翻译:Talaia是一个监测社交媒体和数字媒体的平台,一个可配置爬行器收集与用户定义域或主题有关的内容,通过IXA-管道NLP链和EliXa情绪分析系统处理数据,Django驱动界面提供数据可视化,以提供用户对数据的分析,本文件介绍系统架构,并详细描述系统的不同组成部分,为证明方法的有效性,对两个实际使用案例进行了核算,一个在文化领域,另一个在政治领域,还评估了两种情景的情绪分析任务,显示了领域适应能力。