As one of the most extensive social networking services, Twitter has more than 300 million active users as of 2022. Among its many functions, Twitter is now one of the go-to platforms for consumers to share their opinions about products or experiences, including flight services provided by commercial airlines. This study aims to measure customer satisfaction by analyzing sentiments of Tweets that mention airlines using a machine learning approach. Relevant Tweets are retrieved from Twitter's API and processed through tokenization and vectorization. After that, these processed vectors are passed into a pre-trained machine learning classifier to predict the sentiments. In addition to sentiment analysis, we also perform lexical analysis on the collected Tweets to model keywords' frequencies, which provide meaningful contexts to facilitate the interpretation of sentiments. We then apply time series methods such as Bollinger Bands to detect abnormalities in sentiment data. Using historical records from January to July 2022, our approach is proven to be capable of capturing sudden and significant changes in passengers' sentiment. This study has the potential to be developed into an application that can help airlines, along with several other customer-facing businesses, efficiently detect abrupt changes in customers' sentiments and take adequate measures to counteract them.
翻译:作为最广泛的社会网络服务之一,Twitter自2022年以来拥有3亿多活跃用户。Twitter的功能中,它现在已成为消费者交流产品或经验(包括商业航空公司提供的飞行服务)观点的热门平台之一。这项研究的目的是通过分析Tweets的情绪来衡量客户满意度,Tweets提到航空公司使用机器学习方法的情绪。相关的Tweets是从Twitter的API中检索的,并通过象征性化和病媒化处理。此后,这些经过加工的矢量被传递到一个预先培训的机器学习分类器中,以预测情绪。除了情绪分析外,我们还对收集的Tweets进行词汇分析,以模拟关键词频率,提供有意义的背景,以便利对情绪的解读。然后,我们运用Bollinger Bands等时间序列方法来检测情绪数据中的异常。使用2022年1月至7月的历史记录,我们的方法已证明能够捕捉到乘客情绪的突然和重大变化。这一研究有可能发展成一个应用程序,帮助航空公司和其他几个客户化企业一起帮助客户心智商,高效地检测顾客情绪的突然变化,并采取适当的措施。