聊GPT，还是不聊GPT：这是个问题！ (To ChatGPT, or not to ChatGPT: That is the question!) - 专知论文

会员服务 ·

0

检测技术 · 统计特征 · 高准确性 · 上下文信息 · 文本检测 ·

2023 年 4 月 5 日

To ChatGPT, or not to ChatGPT: That is the question!

翻译：聊GPT，还是不聊GPT：这是个问题！

Alessandro Pegoraro,Kavita Kumari,Hossein Fereidooni,Ahmad-Reza Sadeghi

ChatGPT has become a global sensation. As ChatGPT and other Large Language Models (LLMs) emerge, concerns of misusing them in various ways increase, such as disseminating fake news, plagiarism, manipulating public opinion, cheating, and fraud. Hence, distinguishing AI-generated from human-generated becomes increasingly essential. Researchers have proposed various detection methodologies, ranging from basic binary classifiers to more complex deep-learning models. Some detection techniques rely on statistical characteristics or syntactic patterns, while others incorporate semantic or contextual information to improve accuracy. The primary objective of this study is to provide a comprehensive and contemporary assessment of the most recent techniques in ChatGPT detection. Additionally, we evaluated other AI-generated text detection tools that do not specifically claim to detect ChatGPT-generated content to assess their performance in detecting ChatGPT-generated content. For our evaluation, we have curated a benchmark dataset consisting of prompts from ChatGPT and humans, including diverse questions from medical, open Q&A, and finance domains and user-generated responses from popular social networking platforms. The dataset serves as a reference to assess the performance of various techniques in detecting ChatGPT-generated content. Our evaluation results demonstrate that none of the existing methods can effectively detect ChatGPT-generated content.

翻译：聊GPT已经成为全球轰动的现象。随着ChatGPT和其他大型语言模型（LLMs）的出现，人们越来越担心在各种方式的误用，如散布假新闻、抄袭、操纵公众意见、舞弊和欺诈。因此，区分AI生成的文本和人类生成的文本变得越来越重要。研究人员提议了各种检测方法，从基本的二进制分类器到更复杂的深度学习模型。一些检测技术依赖于统计特征或句法模式，而其他技术则包含语义或上下文信息以提高准确性。本研究的主要目标是综合评估最近的聊GPT检测技术。此外，我们还评估了其他不特别声称检测聊GPT生成内容的AI生成文本检测工具，以评估其在检测聊GPT生成内容方面的性能。为了评估我们的方法，我们准备了基准数据集，包括来自医学、开放问答和金融领域的各种问题以及来自流行社交网络平台的用户生成的响应。该数据集可以用作评估各种技术在检测聊GPT生成内容方面的性能的参考。我们的评估结果表明，目前不存在有效检测聊GPT生成内容的方法。

2

相关内容

检测技术

ChatGPT懂常识吗？中科院等最新《ChatGPT是一个有知识但没有经验的求解器:大型语言模型常识问题的研究》论文，

ChatGPT懂常识吗？中科院等最新《ChatGPT是一个有知识但没有经验的求解器:大型语言模型常识问题的研究》论文，

专知会员服务

80+阅读 · 2023年4月5日

【2022新书】Python数据科学导论，309页pdf

【2022新书】Python数据科学导论，309页pdf

专知会员服务

82+阅读 · 2022年8月6日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

324+阅读 · 2020年11月26日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

【NLP| 推荐文章】知识图谱问答系统的神经网络方法介绍（Introduction to Neural Network based Approaches for Question Answering over Knowledge Graphs）

专知会员服务

59+阅读 · 2019年11月24日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

7 Papers & Radios | Meta「分割一切」AI模型；从T5到GPT-4盘点大语言模型

7 Papers & Radios | Meta「分割一切」AI模型；从T5到GPT-4盘点大语言模型

机器之心

4+阅读 · 2023年4月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

专知

15+阅读 · 2018年2月3日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

麦冬皂苷通过下调lnc-MALAT1抑制NSCLC血管生成的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

甲基化介导沉默MicroRNA-124调控SNAI2蛋白在结直肠癌肝转移上皮间质转化中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

ERLIN2促进乳腺癌细胞生长及调控Herceptin耐药性的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

文本情绪分析中的关键问题研究

国家自然科学基金

3+阅读 · 2012年12月31日

鲁棒的数据驱动的飞机故障诊断技术

国家自然科学基金

0+阅读 · 2012年12月31日

Fucí意义下的跨共振的Sturm-Liouville问题

国家自然科学基金

0+阅读 · 2012年12月31日

某些偏微分方程解的零点集结构研究

国家自然科学基金

0+阅读 · 2012年12月31日

人结直肠癌肝转移中Gankyrin的作用及其机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

句子语义的视觉表示研究

国家自然科学基金

4+阅读 · 2009年12月31日

R2H: Building Multimodal Navigation Helpers that Respond to Help

Arxiv

0+阅读 · 2023年5月23日

Using In-Context Learning to Improve Dialogue Safety

Arxiv

0+阅读 · 2023年5月23日

ChatGPT: Jack of all trades, master of none

Arxiv

0+阅读 · 2023年5月23日

A Perspectival Mirror of the Elephant: Investigating Language Bias on Google, ChatGPT, Wikipedia, and YouTube

Arxiv

0+阅读 · 2023年5月23日

Automatic Code Summarization via ChatGPT: How Far Are We?

Arxiv

0+阅读 · 2023年5月22日

ChatGPT Is More Likely to Be Perceived as Male Than Female

Arxiv

0+阅读 · 2023年5月21日

The Scope of ChatGPT in Software Engineering: A Thorough Investigation

Arxiv

0+阅读 · 2023年5月20日

Post Hoc Explanations of Language Models Can Improve Language Models

Arxiv

0+阅读 · 2023年5月19日

Is ChatGPT a Good Recommender? A Preliminary Study

Arxiv

171+阅读 · 2023年4月20日

From Show to Tell: A Survey on Image Captioning

Arxiv

15+阅读 · 2021年7月14日

VIP会员

文章信息

相关主题

上下文信息

相关VIP内容

ChatGPT懂常识吗？中科院等最新《ChatGPT是一个有知识但没有经验的求解器:大型语言模型常识问题的研究》论文，

ChatGPT懂常识吗？中科院等最新《ChatGPT是一个有知识但没有经验的求解器:大型语言模型常识问题的研究》论文，

专知会员服务

80+阅读 · 2023年4月5日

【2022新书】Python数据科学导论，309页pdf

【2022新书】Python数据科学导论，309页pdf

专知会员服务

82+阅读 · 2022年8月6日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

324+阅读 · 2020年11月26日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

20篇「ACL2020」最新论文抢先看！看自然语言处理2020在研究什么？

专知会员服务

97+阅读 · 2020年4月10日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

【NLP| 推荐文章】知识图谱问答系统的神经网络方法介绍（Introduction to Neural Network based Approaches for Question Answering over Knowledge Graphs）

专知会员服务

59+阅读 · 2019年11月24日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

NeurIPS 2025 | 自动化所新作速览（一）

大型语言模型（LLM）赋能的知识图谱构建：综述

NeurIPS 2025 | 自动化所新作速览（二）

领域特定文本分类中的预训练语言模型新进展：系统综述

相关资讯

7 Papers & Radios | Meta「分割一切」AI模型；从T5到GPT-4盘点大语言模型

7 Papers & Radios | Meta「分割一切」AI模型；从T5到GPT-4盘点大语言模型

机器之心

4+阅读 · 2023年4月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

【论文推荐】最新6篇视觉问答（VQA）相关论文—目标推理、深度循环模型、可解释性、数据可视化、Triplet学习、基准

专知

15+阅读 · 2018年2月3日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

相关论文

R2H: Building Multimodal Navigation Helpers that Respond to Help

Arxiv

0+阅读 · 2023年5月23日

Using In-Context Learning to Improve Dialogue Safety

Arxiv

0+阅读 · 2023年5月23日

ChatGPT: Jack of all trades, master of none

Arxiv

0+阅读 · 2023年5月23日

A Perspectival Mirror of the Elephant: Investigating Language Bias on Google, ChatGPT, Wikipedia, and YouTube

Arxiv

0+阅读 · 2023年5月23日

Automatic Code Summarization via ChatGPT: How Far Are We?

Arxiv

0+阅读 · 2023年5月22日

ChatGPT Is More Likely to Be Perceived as Male Than Female

Arxiv

0+阅读 · 2023年5月21日

The Scope of ChatGPT in Software Engineering: A Thorough Investigation

Arxiv

0+阅读 · 2023年5月20日

Post Hoc Explanations of Language Models Can Improve Language Models

Arxiv

0+阅读 · 2023年5月19日

Is ChatGPT a Good Recommender? A Preliminary Study

Arxiv

171+阅读 · 2023年4月20日

From Show to Tell: A Survey on Image Captioning

Arxiv

15+阅读 · 2021年7月14日

相关基金

麦冬皂苷通过下调lnc-MALAT1抑制NSCLC血管生成的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

甲基化介导沉默MicroRNA-124调控SNAI2蛋白在结直肠癌肝转移上皮间质转化中的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

ERLIN2促进乳腺癌细胞生长及调控Herceptin耐药性的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

文本情绪分析中的关键问题研究

国家自然科学基金

3+阅读 · 2012年12月31日

鲁棒的数据驱动的飞机故障诊断技术

国家自然科学基金

0+阅读 · 2012年12月31日

Fucí意义下的跨共振的Sturm-Liouville问题

国家自然科学基金

0+阅读 · 2012年12月31日

某些偏微分方程解的零点集结构研究

国家自然科学基金

0+阅读 · 2012年12月31日

人结直肠癌肝转移中Gankyrin的作用及其机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

句子语义的视觉表示研究

国家自然科学基金

4+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员