Question: I have five fingers but I am not alive. What am I? Answer: a glove. Answering such a riddle-style question is a challenging cognitive process, in that it requires complex commonsense reasoning abilities, an understanding of figurative language, and counterfactual reasoning skills, which are all important abilities for advanced natural language understanding (NLU). However, there are currently no dedicated datasets aiming to test these abilities. Herein, we present RiddleSense, a new multiple-choice question answering task, which comes with the first large dataset (5.7k examples) for answering riddle-style commonsense questions. We systematically evaluate a wide range of models over the challenge, and point out that there is a large gap between the best-supervised model and human performance -- suggesting intriguing future research in the direction of higher-order commonsense reasoning and linguistic creativity towards building advanced NLU systems.
翻译:问题:我有五根手指,但我还活着。答案:我是什么?回答:手套。回答这样一个谜式的问题是一个具有挑战性的认知过程,因为它需要复杂的常识推理能力、对比喻语言的理解和反事实推理技能,这些都是先进的自然语言理解(NLU)的重要能力。然而,目前没有专门的数据组来测试这些能力。在这里,我们介绍一个新的多选择问题解答任务里德尔森塞,这是用来回答谜式常识问题的第一套大数据集(5.7k例),用来回答谜状常识问题。我们系统地评估了有关挑战的多种模型,并指出,最受监督的模式与人类表现之间存在巨大差距,表明未来研究的方向是更高的常识推理和语言创造性,以建设先进的NLU系统。