确定问答门户专家:关于重新编辑数据科学能力案例研究 (Identifying Experts in Question & Answer Portals: A Case Study on Data Science Competencies in Reddit) - 专知论文

会员服务 ·

0

可辨认的 · 讲稿 · CASE · INFORMS · Attention ·

2022 年 9 月 1 日

Identifying Experts in Question & Answer Portals: A Case Study on Data Science Competencies in Reddit

翻译：确定问答门户专家:关于重新编辑数据科学能力案例研究

Sofia Strukova,José A. Ruipérez-Valiente,Félix Gómez Mármol

The irreplaceable key to the triumph of Question & Answer (Q&A) platforms is their users providing high-quality answers to the challenging questions posted across various topics of interest. From more than a decade, the expert finding problem attracted much attention in information retrieval research. Based on the encountered gaps in the expert identification across several Q&A portals, we inspect the feasibility of identifying data science experts in Reddit. Our method is based on the manual coding results where two data science experts labelled not only expert and non-expert comments, but also out-of-scope comments, which is a novel contribution to the literature, enabling the identification of more groups of comments across web portals. We present a semi-supervised approach which combines 1,113 labelled comments with 100,226 unlabelled comments during training. The proposed model uses the activity behaviour of every user, including Natural Language Processing (NLP), crowdsourced and user feature sets. We conclude that the NLP and user feature sets contribute the most to the better identification of these three classes. It means that this method can generalise well within the domain. Finally, we make a novel contribution by presenting different types of users in Reddit, which opens many future research directions.

翻译：问题和答案(QA)平台取得胜利的不可替代的关键在于其用户对各种感兴趣的议题提出的具有挑战性的问题提供高质量的答案。从十多年以来,专家发现问题在信息检索研究中引起了很大的注意。根据在专家识别方面发现的若干“A”门户网站上遇到的差距,我们检查了在Reddit中识别数据科学专家的可行性。我们的方法是基于手册编码结果,其中两名数据科学专家不仅将专家和非专家的评论标注为专家和非专家的评论,而且还将范围外的评论标为“评论”,这是对文献的新的贡献,使得能够查明跨网络门户的更多评论群。我们提出了一个半监督方法,将1 113条贴标签的评论与100 226条未贴标签的评论结合到信息检索研究中。拟议模型使用每个用户的活动行为,包括自然语言处理、众源和用户特征。我们的结论是,“NLP”和用户特征对更好地识别这三类内容的贡献最大。这意味着,这种方法可以在域内进行广泛归纳。最后,我们提出了一种半监督的方法,将1 113条标有标签的评论与100 226条在培训中未贴标签的评论合并。拟议模型使用每个用户的活动行为。我们通过展示了许多类型的未来研究方向作出新的贡献。

0

相关内容

可辨认的

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

铝基非晶纳米晶复合涂层多相耦合微区腐蚀规律研究

国家自然科学基金

0+阅读 · 2015年12月31日

羰基化合物激发态势能面交叉动力学的共振拉曼光谱和CASSCF计算研究

国家自然科学基金

0+阅读 · 2014年12月31日

马比木内生真菌及其抗肿瘤活性物质的研究

国家自然科学基金

0+阅读 · 2014年12月31日

激光熔覆制备单晶MCrAlY涂层微结构调控及高温氧化行为研究

国家自然科学基金

0+阅读 · 2013年12月31日

城市污水二级出水有机物组成特征及其与重金属作用规律和机制

国家自然科学基金

0+阅读 · 2013年12月31日

芯片电泳-原子光/质谱联用技术在生物分子高灵敏检测中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

某些重金属的核酸适配体非标记纳米粒子-SERS光谱分析

国家自然科学基金

0+阅读 · 2012年12月31日

颗粒材料中的偶应力效应及Cosserat介质本构模拟研究

国家自然科学基金

0+阅读 · 2011年12月31日

宽抑制特性P波段窄带高温超导滤波器关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

Nanostrands－膨胀石墨电磁屏蔽材料制备及性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

Understanding Person Identification through Gait

Arxiv

0+阅读 · 2022年10月18日

A Comprehensive Analysis of Acknowledgement Texts in Web of Science: a case study on four scientific domains

Arxiv

0+阅读 · 2022年10月18日

Challenging Social Media Threats using Collective Well-being Aware Recommendation Algorithms and an Educational Virtual Companion

Arxiv

0+阅读 · 2022年10月17日

Explainable Causal Analysis of Mental Health on Social Media Data

Arxiv

0+阅读 · 2022年10月16日

New Secure Sparse Inner Product with Applications to Machine Learning

Arxiv

0+阅读 · 2022年10月16日

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

Arxiv

19+阅读 · 2021年10月28日

Trustworthy AI: From Principles to Practices

Arxiv

46+阅读 · 2021年10月4日

Affective Image Content Analysis: Two Decades Review and New Perspectives

Arxiv

16+阅读 · 2021年6月30日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

Knowledge Representation Learning: A Quantitative Review

Knowledge Representation Learning: A Quantitative Review

Arxiv

28+阅读 · 2018年12月28日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Understanding Person Identification through Gait

Arxiv

0+阅读 · 2022年10月18日

A Comprehensive Analysis of Acknowledgement Texts in Web of Science: a case study on four scientific domains

Arxiv

0+阅读 · 2022年10月18日

Challenging Social Media Threats using Collective Well-being Aware Recommendation Algorithms and an Educational Virtual Companion

Arxiv

0+阅读 · 2022年10月17日

Explainable Causal Analysis of Mental Health on Social Media Data

Arxiv

0+阅读 · 2022年10月16日

New Secure Sparse Inner Product with Applications to Machine Learning

Arxiv

0+阅读 · 2022年10月16日

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

Arxiv

19+阅读 · 2021年10月28日

Trustworthy AI: From Principles to Practices

Arxiv

46+阅读 · 2021年10月4日

Affective Image Content Analysis: Two Decades Review and New Perspectives

Arxiv

16+阅读 · 2021年6月30日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

Knowledge Representation Learning: A Quantitative Review

Knowledge Representation Learning: A Quantitative Review

Arxiv

28+阅读 · 2018年12月28日

相关基金

铝基非晶纳米晶复合涂层多相耦合微区腐蚀规律研究

国家自然科学基金

0+阅读 · 2015年12月31日

羰基化合物激发态势能面交叉动力学的共振拉曼光谱和CASSCF计算研究

国家自然科学基金

0+阅读 · 2014年12月31日

马比木内生真菌及其抗肿瘤活性物质的研究

国家自然科学基金

0+阅读 · 2014年12月31日

激光熔覆制备单晶MCrAlY涂层微结构调控及高温氧化行为研究

国家自然科学基金

0+阅读 · 2013年12月31日

城市污水二级出水有机物组成特征及其与重金属作用规律和机制

国家自然科学基金

0+阅读 · 2013年12月31日

芯片电泳-原子光/质谱联用技术在生物分子高灵敏检测中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

某些重金属的核酸适配体非标记纳米粒子-SERS光谱分析

国家自然科学基金

0+阅读 · 2012年12月31日

颗粒材料中的偶应力效应及Cosserat介质本构模拟研究

国家自然科学基金

0+阅读 · 2011年12月31日

宽抑制特性P波段窄带高温超导滤波器关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

Nanostrands－膨胀石墨电磁屏蔽材料制备及性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员