ChatGPT能否复制人工生成的标签？对社交计算任务的研究 (Can ChatGPT Reproduce Human-Generated Labels? A Study of Social Computing Tasks)

The release of ChatGPT has uncovered a range of possibilities whereby large language models (LLMs) can substitute human intelligence. In this paper, we seek to understand whether ChatGPT has the potential to reproduce human-generated label annotations in social computing tasks. Such an achievement could significantly reduce the cost and complexity of social computing research. As such, we use ChatGPT to re-label five seminal datasets covering stance detection (2x), sentiment analysis, hate speech, and bot detection. Our results highlight that ChatGPT does have the potential to handle these data annotation tasks, although a number of challenges remain. ChatGPT obtains an average precision 0.609. Performance is highest for the sentiment analysis dataset, with ChatGPT correctly annotating 64.9% of tweets. Yet, we show that performance varies substantially across individual labels. We believe this work can open up new lines of analysis and act as a basis for future research into the exploitation of ChatGPT for human annotation tasks.

翻译：ChatGPT的发布揭示了大语言模型（LLMs）可以替代人类智能的各种可能性。在本文中，我们试图了解ChatGPT是否有潜力在社交计算任务中复制人工生成的标签注释。这样的成就可以极大地降低社交计算研究的成本和复杂性。因此，我们使用ChatGPT重新标记了五个语料库，涵盖态度检测（2x）、情感分析、仇恨言论和机器人检测。我们的结果表明，ChatGPT确实有处理这些数据注释任务的潜力，尽管仍存在许多挑战。 ChatGPT的平均精度为0.609。情感分析语料库的性能最好，ChatGPT可正确注释64.9％的推文。然而，我们表明性能在各个标签之间存在巨大的差异。我们相信这项工作可以开展新的分析线，并成为ChatGPT用于人工注释任务的未来研究的基础。

相关内容

ChatGPT

关注 257

ChatGPT（全名：Chat Generative Pre-trained Transformer），美国OpenAI 研发的聊天机器人程序 [1] ，于2022年11月30日发布。ChatGPT是人工智能技术驱动的自然语言处理工具，它能够通过学习和理解人类的语言来进行对话，还能根据聊天的上下文进行互动，真正像人类一样来聊天交流，甚至能完成撰写邮件、视频脚本、文案、翻译、代码，写论文任务。 [1] https://openai.com/blog/chatgpt/

从ChatGPT看AI未来趋势和挑战 | 万字长文

专知会员服务

174+阅读 · 2023年4月18日

【李老师400+页的ChatGPT全面介绍PPT】《ChatGPT的前世今生》

专知会员服务

173+阅读 · 2023年4月13日

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

专知会员服务

26+阅读 · 2022年3月1日

【ACM Multimedia2021-tutorial】可信赖多媒体分析

专知会员服务

18+阅读 · 2021年10月20日