在德国Covid-19社交媒体上, (Investigating label suggestions for opinion mining in German Covid-19 social media) - 专知论文

会员服务 ·

0

MINE · INTERACT · 标注 · COVID-19 · MoDELS ·

2021 年 6 月 8 日

Investigating label suggestions for opinion mining in German Covid-19 social media

翻译：在德国Covid-19社交媒体上,

Tilman Beck,Ji-Ung Lee,Christina Viehmann,Marcus Maurer,Oliver Quiring,Iryna Gurevych

from arxiv, To Appear at ACL 2021

This work investigates the use of interactively updated label suggestions to improve upon the efficiency of gathering annotations on the task of opinion mining in German Covid-19 social media data. We develop guidelines to conduct a controlled annotation study with social science students and find that suggestions from a model trained on a small, expert-annotated dataset already lead to a substantial improvement - in terms of inter-annotator agreement(+.14 Fleiss' $\kappa$) and annotation quality - compared to students that do not receive any label suggestions. We further find that label suggestions from interactively trained models do not lead to an improvement over suggestions from a static model. Nonetheless, our analysis of suggestion bias shows that annotators remain capable of reflecting upon the suggested label in general. Finally, we confirm the quality of the annotated data in transfer learning experiments between different annotator groups. To facilitate further research in opinion mining on social media data, we release our collected data consisting of 200 expert and 2,785 student annotations.

翻译：这项工作调查了使用互动更新标签建议提高德国Covid-19社交媒体数据中意见采矿任务说明收集效率的效率,我们制定了指导方针,对社会科学学生进行有控制的批注研究,发现在小型、专家附加说明的数据集方面受过培训的模型提出的建议已经导致在同未收到任何标签建议的学生相比,在跨咨询协议(+.14 Fleiss' $\kappa$)和批注质量方面大大改进。我们进一步发现,互动培训模型的标签建议不会导致对静态模型的建议的改进。然而,我们对建议偏向性的分析表明,批注者仍然能够反映建议的一般标签。最后,我们确认不同批注小组之间转让学习实验的附加数据的质量。为了便利对社会媒体数据的意见挖掘的进一步研究,我们发布了我们收集到的由200名专家和2 785名学生组成的数据。

0

相关内容

MINE

GNN4Rec-3：图神经网络在阿里推荐中的应用

专知会员服务

25+阅读 · 2021年8月3日

【2020新书】社交媒体挖掘，212pdf，Mining Social Media

【2020新书】社交媒体挖掘，212pdf，Mining Social Media

专知会员服务

63+阅读 · 2020年7月30日

最新《知识图谱复杂问答》综述论文，A Survey on Complex Question Answering over Knowledge Base: Recent Advances and Challenges

最新《知识图谱复杂问答》综述论文，A Survey on Complex Question Answering over Knowledge Base: Recent Advances and Challenges

专知会员服务

74+阅读 · 2020年7月28日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

【MMM 2019 Tutorials】多模态深度学习（Multimodal Deep Learning），巴塞罗那加泰罗尼亚大学（UPC）的副教授Xavier Giro-i-Nieto

【MMM 2019 Tutorials】多模态深度学习（Multimodal Deep Learning），巴塞罗那加泰罗尼亚大学（UPC）的副教授Xavier Giro-i-Nieto

专知会员服务

7+阅读 · 2019年1月8日

计算机 | IUI 2020等国际会议信息4条

计算机 | IUI 2020等国际会议信息4条

Call4Papers

6+阅读 · 2019年6月17日

CCF C类 | DSAA 2019 诚邀稿件

CCF C类 | DSAA 2019 诚邀稿件

Call4Papers

6+阅读 · 2019年5月13日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

清华大学NLP组整理的机器翻译论文阅读清单

清华大学NLP组整理的机器翻译论文阅读清单

AINLP

5+阅读 · 2018年12月29日

自然语言处理顶会 ACL 2018 参会见闻

自然语言处理顶会 ACL 2018 参会见闻

PaperWeekly

3+阅读 · 2018年7月26日

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

专知

19+阅读 · 2018年6月1日

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

专知

12+阅读 · 2018年5月9日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset

Arxiv

0+阅读 · 2021年8月2日

You too Brutus! Trapping Hateful Users in Social Media: Challenges, Solutions & Insights

Arxiv

0+阅读 · 2021年8月1日

Multimodal Co-learning: Challenges, Applications with Datasets, Recent Advances and Future Directions

Arxiv

0+阅读 · 2021年7月29日

Discovering 3D Parts from Image Collections

Arxiv

0+阅读 · 2021年7月28日

Multimodal Emergent Fake News Detection via Meta Neural Process Networks

Arxiv

6+阅读 · 2021年6月22日

Multimodal Categorization of Crisis Events in Social Media

Multimodal Categorization of Crisis Events in Social Media

Arxiv

20+阅读 · 2020年4月10日

Deep Metric Transfer for Label Propagation with Limited Annotated Data

Arxiv

3+阅读 · 2018年12月20日

Knowledge Based Machine Reading Comprehension

Knowledge Based Machine Reading Comprehension

Arxiv

4+阅读 · 2018年9月12日

Multilingual Topic Models

Arxiv

3+阅读 · 2017年12月18日

DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications

Arxiv

4+阅读 · 2017年11月15日

VIP会员

文章信息

相关主题

相关VIP内容

GNN4Rec-3：图神经网络在阿里推荐中的应用

专知会员服务

25+阅读 · 2021年8月3日

【2020新书】社交媒体挖掘，212pdf，Mining Social Media

【2020新书】社交媒体挖掘，212pdf，Mining Social Media

专知会员服务

63+阅读 · 2020年7月30日

最新《知识图谱复杂问答》综述论文，A Survey on Complex Question Answering over Knowledge Base: Recent Advances and Challenges

最新《知识图谱复杂问答》综述论文，A Survey on Complex Question Answering over Knowledge Base: Recent Advances and Challenges

专知会员服务

74+阅读 · 2020年7月28日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

【MMM 2019 Tutorials】多模态深度学习（Multimodal Deep Learning），巴塞罗那加泰罗尼亚大学（UPC）的副教授Xavier Giro-i-Nieto

【MMM 2019 Tutorials】多模态深度学习（Multimodal Deep Learning），巴塞罗那加泰罗尼亚大学（UPC）的副教授Xavier Giro-i-Nieto

专知会员服务

7+阅读 · 2019年1月8日

热门VIP内容

开通专知VIP会员享更多权益服务

【NTU博士论文】反事实推理在多模态对话生成中的应用

基于强化学习的智能体化搜索全面综述：基础、角色、优化、评估与应用

ICCV最佳论文出炉，朱俊彦团队用砖块积木摘得桂冠

面向具身操作的高效视觉–语言–动作模型：系统综述

相关资讯

计算机 | IUI 2020等国际会议信息4条

计算机 | IUI 2020等国际会议信息4条

Call4Papers

6+阅读 · 2019年6月17日

CCF C类 | DSAA 2019 诚邀稿件

CCF C类 | DSAA 2019 诚邀稿件

Call4Papers

6+阅读 · 2019年5月13日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

清华大学NLP组整理的机器翻译论文阅读清单

清华大学NLP组整理的机器翻译论文阅读清单

AINLP

5+阅读 · 2018年12月29日

自然语言处理顶会 ACL 2018 参会见闻

自然语言处理顶会 ACL 2018 参会见闻

PaperWeekly

3+阅读 · 2018年7月26日

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

专知

19+阅读 · 2018年6月1日

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

【论文推荐】最新六篇自动问答相关论文—无监督迁移学习、综述、生成式问答、QDEE、可扩展文档理解

专知

12+阅读 · 2018年5月9日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset

Arxiv

0+阅读 · 2021年8月2日

You too Brutus! Trapping Hateful Users in Social Media: Challenges, Solutions & Insights

Arxiv

0+阅读 · 2021年8月1日

Multimodal Co-learning: Challenges, Applications with Datasets, Recent Advances and Future Directions

Arxiv

0+阅读 · 2021年7月29日

Discovering 3D Parts from Image Collections

Arxiv

0+阅读 · 2021年7月28日

Multimodal Emergent Fake News Detection via Meta Neural Process Networks

Arxiv

6+阅读 · 2021年6月22日

Multimodal Categorization of Crisis Events in Social Media

Multimodal Categorization of Crisis Events in Social Media

Arxiv

20+阅读 · 2020年4月10日

Deep Metric Transfer for Label Propagation with Limited Annotated Data

Arxiv

3+阅读 · 2018年12月20日

Knowledge Based Machine Reading Comprehension

Knowledge Based Machine Reading Comprehension

Arxiv

4+阅读 · 2018年9月12日

Multilingual Topic Models

Arxiv

3+阅读 · 2017年12月18日

DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications

Arxiv

4+阅读 · 2017年11月15日

微信扫码咨询专知VIP会员