Sarcasm Detection has enjoyed great interest from the research community, however the task of predicting sarcasm in a text remains an elusive problem for machines. Past studies mostly make use of twitter datasets collected using hashtag based supervision but such datasets are noisy in terms of labels and language. To overcome these shortcoming, we introduce a new dataset which contains news headlines from a sarcastic news website and a real news website. Next, we propose a hybrid Neural Network architecture with attention mechanism which provides insights about what actually makes sentences sarcastic. Through experiments, we show that the proposed model improves upon the baseline by ~ 5% in terms of classification accuracy.
翻译:讽刺的探测引起了研究界的极大兴趣,然而,在文本中预测讽刺性内容的任务对于机器来说仍是一个难以解决的问题。过去的研究大多利用以标签为基础的监督手段收集的Twitter数据集,但这类数据集在标签和语言方面是吵闹的。为了克服这些缺陷,我们引入了一个新的数据集,其中包含来自讽刺性新闻网站和真正的新闻网站的新闻头条。接下来,我们提议建立一个混合神经网络架构,其中含有关注机制,对是什么使判决具有讽刺性的内容提供洞察力。我们通过实验显示,拟议的模型在分类准确性方面比基线提高了5%。