Sentiment analysis as a sub-field of natural language processing has received increased attention in the past decade enabling organisations to more effectively manage their reputation through online media monitoring. Many drivers impact reputation, however, this thesis focuses only the aspect of financial performance and explores the gap with regards to financial sentiment analysis in a South African context. Results showed that pre-trained sentiment analysers are least effective for this task and that traditional lexicon-based and machine learning approaches are best suited to predict financial sentiment of news articles. The evaluated methods produced accuracies of 84\%-94\%. The predicted sentiments correlated quite well with share price and highlighted the potential use of sentiment as an indicator of financial performance. A main contribution of the study was updating an existing sentiment dictionary for financial sentiment analysis. Model generalisation was less acceptable due to the limited amount of training data used. Future work includes expanding the data set to improve general usability and contribute to an open-source financial sentiment analyser for South African data.
翻译:过去十年来,作为自然语言处理的子领域,感官分析日益受到重视,使各组织能够通过在线媒体监测更有效地管理其声誉,但许多驱动因素对声誉产生影响,但许多驱动因素只关注财务业绩方面,并探讨在南非金融情绪分析方面存在的差距,结果显示,预先培训的情绪分析师对这项任务的效果最差,传统的词汇和机器学习方法最适于预测新闻文章的金融情绪。经过评价的方法产生了84 ⁇ -94 ⁇ 的便利。预测的情绪与股价相当接近,并突显了将情绪用作财务业绩指标的可能性。研究的主要贡献是更新了现有的金融情绪分析情绪词典。由于使用的培训数据数量有限,模型的概括性不太为人们所接受。未来的工作包括扩大数据集,以提高南非数据的一般可用性和有助于开放源金融情绪分析器。