Recently, ChatGPT has drawn great attention from both the research community and the public. We are particularly curious about whether it can serve as a universal sentiment analyzer. To this end, in this work, we provide a preliminary evaluation of ChatGPT on the understanding of opinions, sentiments, and emotions contained in the text. Specifically, we evaluate it in four settings, including standard evaluation, polarity shift evaluation, open-domain evaluation, and sentiment inference evaluation. The above evaluation involves 18 benchmark datasets and 5 representative sentiment analysis tasks, and we compare ChatGPT with fine-tuned BERT and corresponding state-of-the-art (SOTA) models on end-task. Moreover, we also conduct human evaluation and present some qualitative case studies to gain a deep comprehension of its sentiment analysis capabilities.
翻译:最近,ChatGPT在学术界和公众中引起了极大的关注。我们特别关注它是否可以作为通用情感分析器。为此,在本文中,我们对ChatGPT在理解文本中所包含的各种观点、情感和情绪方面进行了初步评估。具体而言,我们在四个设置中对其进行评估,包括标准评估,极性变化评估,开放域评估和情感推断评估。以上评估涉及18个基准数据集和5个代表性情感分析任务,我们将ChatGPT与微调后的BERT和相应的最先进模型进行了端到端的比较。此外,我们还进行了人类评估,并提供了一些定性案例研究,以深入理解其情感分析能力。