如何获得资金?! 自动识别昆基科学成就 (How Did This Get Funded?! Automatically Identifying Quirky Scientific Achievements) - 专知论文

会员服务 ·

0

可辨认的 · state-of-the-art · Integration · NLP · 数据集 ·

2021 年 6 月 6 日

How Did This Get Funded?! Automatically Identifying Quirky Scientific Achievements

翻译：如何获得资金?! 自动识别昆基科学成就

Chen Shani,Nadav Borenstein,Dafna Shahaf

from arxiv, To be published in the main conference of ACL-IJCNLP2021. Code and dataset can be found here: https://github.com/nadavborenstein/Iggy

Humor is an important social phenomenon, serving complex social and psychological functions. However, despite being studied for millennia humor is computationally not well understood, often considered an AI-complete problem. In this work, we introduce a novel setting in humor mining: automatically detecting funny and unusual scientific papers. We are inspired by the Ig Nobel prize, a satirical prize awarded annually to celebrate funny scientific achievements (example past winner: "Are cows more likely to lie down the longer they stand?"). This challenging task has unique characteristics that make it particularly suitable for automatic learning. We construct a dataset containing thousands of funny papers and use it to learn classifiers, combining findings from psychology and linguistics with recent advances in NLP. We use our models to identify potentially funny papers in a large dataset of over 630,000 articles. The results demonstrate the potential of our methods, and more broadly the utility of integrating state-of-the-art NLP methods with insights from more traditional disciplines.

翻译：幽默是一种重要的社会现象,具有复杂的社会和心理功能。然而,尽管对千百年的幽默进行了研究,但是在计算上并没有很好地理解,常常被认为是一个完整的AI问题。在这项工作中,我们引入了幽默挖掘的新环境:自动发现有趣的和不寻常的科学论文。我们受到Ig 诺贝尔奖的启发,这是一个每年为庆祝有趣的科学成就而颁发的讽刺奖(例如,过去的胜利者“牛更可能躺得越久?” )。这项具有挑战性的任务具有独特性的特点,使得它特别适合自动学习。我们建立了一个数据集,包含数千篇有趣的论文,并用来学习分类学,将心理学和语言学的研究结果与NLP的最新进展结合起来。我们利用我们的模型在63万多篇文章的大型数据集中识别潜在有趣的论文。其结果显示了我们的方法的潜力,以及将最先进的NLP方法与较传统的学科的洞察力结合起来的效用。

0

相关内容

可辨认的

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

113+阅读 · 2021年4月17日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书】Python数据科学食谱（Python Data Science Cookbook）

【新书】Python数据科学食谱（Python Data Science Cookbook）

专知会员服务

117+阅读 · 2020年1月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

Measuring Disagreement in Science

Arxiv

0+阅读 · 2021年7月30日

An automated domain-independent text reading, interpreting and extracting approach for reviewing the scientific literature

Arxiv

0+阅读 · 2021年7月30日

Machine Learning Advances aiding Recognition and Classification of Indian Monuments and Landmarks

Machine Learning Advances aiding Recognition and Classification of Indian Monuments and Landmarks

Arxiv

0+阅读 · 2021年7月29日

Tab2Know: Building a Knowledge Base from Tables in Scientific Papers

Arxiv

0+阅读 · 2021年7月28日

A Survey of Deep Learning for Scientific Discovery

A Survey of Deep Learning for Scientific Discovery

Arxiv

29+阅读 · 2020年3月26日

The Consciousness Prior

Arxiv

4+阅读 · 2019年12月2日

Challenges in Building Intelligent Open-domain Dialog Systems

Arxiv

8+阅读 · 2019年10月22日

Advances in Natural Language Question Answering: A Review

Advances in Natural Language Question Answering: A Review

Arxiv

5+阅读 · 2019年4月10日

End-to-End Learning for Answering Structured Queries Directly over Text

Arxiv

3+阅读 · 2018年11月16日

How do you correct run-on sentences it's not as easy as it seems

Arxiv

4+阅读 · 2018年9月21日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【如何做研究】How to research ，22页ppt

【如何做研究】How to research ，22页ppt

专知会员服务

113+阅读 · 2021年4月17日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书】Python数据科学食谱（Python Data Science Cookbook）

【新书】Python数据科学食谱（Python Data Science Cookbook）

专知会员服务

117+阅读 · 2020年1月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】《知识图谱与大语言模型的协同应用》，544页pdf

军事通信系统：安全行动的支柱

《缓解大语言模型（LLMs）幻觉：面向应用的检索增强生成（RAG）、推理与智能体系统综述》

【新书】机器学习系统，2620页pdf

相关资讯

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

相关论文

Measuring Disagreement in Science

Arxiv

0+阅读 · 2021年7月30日

An automated domain-independent text reading, interpreting and extracting approach for reviewing the scientific literature

Arxiv

0+阅读 · 2021年7月30日

Machine Learning Advances aiding Recognition and Classification of Indian Monuments and Landmarks

Machine Learning Advances aiding Recognition and Classification of Indian Monuments and Landmarks

Arxiv

0+阅读 · 2021年7月29日

Tab2Know: Building a Knowledge Base from Tables in Scientific Papers

Arxiv

0+阅读 · 2021年7月28日

A Survey of Deep Learning for Scientific Discovery

A Survey of Deep Learning for Scientific Discovery

Arxiv

29+阅读 · 2020年3月26日

The Consciousness Prior

Arxiv

4+阅读 · 2019年12月2日

Challenges in Building Intelligent Open-domain Dialog Systems

Arxiv

8+阅读 · 2019年10月22日

Advances in Natural Language Question Answering: A Review

Advances in Natural Language Question Answering: A Review

Arxiv

5+阅读 · 2019年4月10日

End-to-End Learning for Answering Structured Queries Directly over Text

Arxiv

3+阅读 · 2018年11月16日

How do you correct run-on sentences it's not as easy as it seems

Arxiv

4+阅读 · 2018年9月21日

微信扫码咨询专知VIP会员