Recent neural language models have taken a significant step forward in producing remarkably controllable, fluent, and grammatical text. Although studies have found that AI-generated text is not distinguishable from human-written text for crowd-sourcing workers, there still exist errors in AI-generated text which are even subtler and harder to spot. We primarily focus on the scenario in which scientific AI writing assistant is deeply involved. First, we construct a feature description framework to distinguish between AI-generated text and human-written text from syntax, semantics, and pragmatics based on the human evaluation. Then we utilize the features, i.e., writing style, coherence, consistency, and argument logistics, from the proposed framework to analyze two types of content. Finally, we adopt several publicly available methods to investigate the gap of between AI-generated scientific text and human-written scientific text by AI-generated scientific text detection models. The results suggest that while AI has the potential to generate scientific content that is as accurate as human-written content, there is still a gap in terms of depth and overall quality. The AI-generated scientific content is more likely to contain errors in factual issues. We find that there exists a "writing style" gap between AI-generated scientific text and human-written scientific text. Based on the analysis result, we summarize a series of model-agnostic and distribution-agnostic features for detection tasks in other domains. Findings in this paper contribute to guiding the optimization of AI models to produce high-quality content and addressing related ethical and security concerns.
翻译:最近的神经语言模型在产生可明显控制、流畅和语法文本方面迈出了一大步。虽然研究发现AI产生的文本与众包工人的人类写成文本没有区别,但在AI产生的文本中仍然存在一些差错,这些差错甚至更微妙,更难发现。我们主要侧重于科学AI写作助理深入参与其中的情景。首先,我们建立了一个特征描述框架,根据人文评估,将AI产生的文本和人文写成的文本与语法、语义和务实文本区别开来。然后,我们利用拟议框架的特征,即写作风格、一致性、一致性和论证后勤,来分析两种内容。最后,我们采用几种公开可用的方法来调查AI产生的科学文本与由AI产生的科学文本探测模型编写的人类写作科学文本之间的差距。结果表明,尽管AI具有产生与人文写作内容一样准确的科学内容的潜力,但在深度和总体质量方面仍然存在差距。 AI产生的科学内容模型中的差距更可能包含真实的文本格式,我们发现“在复制的文本和复制结果方面”的序列中,我们发现“在复制的文本上存在着一种错误。