Sarcasm can be defined as saying or writing the opposite of what one truly wants to express, usually to insult, irritate, or amuse someone. Because of the obscure nature of sarcasm in textual data, detecting it is difficult and of great interest to the sentiment analysis research community. Though the research in sarcasm detection spans more than a decade, some significant advancements have been made recently, including employing unsupervised pre-trained transformers in multimodal environments and integrating context to identify sarcasm. In this study, we aim to provide a brief overview of recent advancements and trends in computational sarcasm research for the English language. We describe relevant datasets, methodologies, trends, issues, challenges, and tasks relating to sarcasm that are beyond detection. Our study provides well-summarized tables of sarcasm datasets, sarcastic features and their extraction methods, and performance analysis of various approaches which can help researchers in related domains understand current state-of-the-art practices in sarcasm detection.
翻译:讽刺学可以被定义为说或写字与真正想要表达的相反,通常是为了侮辱、刺激或取笑某人。由于文字数据中讽刺的模糊性,因此很难发现它,而且令情绪分析研究界非常感兴趣。虽然讽刺学探测研究跨越了十多年,但最近取得了一些重大进步,包括在多式联运环境中使用未经监督的预先训练变压器,以及结合背景来辨别讽刺。在这项研究中,我们旨在简要概述英国语言计算讽刺学研究的最新进展和趋势。我们描述了与讽刺有关的数据集、方法、趋势、问题、挑战和任务,这些都无法探测出来。我们的研究提供了精美的讽刺学数据集、讽刺特征及其提取方法的图表,以及对各种方法的绩效分析,这些方法可以帮助相关领域的研究人员了解在讽刺学探测中的最新技术做法。