GPT-4技术报告 (GPT-4 Technical Report)

We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was developing infrastructure and optimization methods that behave predictably across a wide range of scales. This allowed us to accurately predict some aspects of GPT-4's performance based on models trained with no more than 1/1,000th the compute of GPT-4.

翻译：我们报告了GPT-4的开发，这是一个大规模的多模态模型，可以接受图像和文本输入并生成文本输出。虽然在许多真实世界的场景中不如人类，但GPT-4在各种专业和学术基准测试中表现出人类水平的性能，包括通过模拟的律师考试并获得约占考生前10%的分数。 GPT-4是一种基于Transformer的模型，预先训练用于预测文档中的下一个令牌。后训练的对齐过程导致GPT-4在事实准确性和遵循所需行为方面表现更好。该项目的核心组件是开发基础设施和优化方法，在各种规模范围内都能表现出可预测的行为。这使我们能够根据使用不超过GPT-4 1/1,000的计算的模型准确预测GPT-4的某些性能方面。

相关内容

GPT-4

关注 0

北京时间2023年3月15日凌晨，ChatGPT开发商OpenAI 发布了发布了全新的多模态预训练大模型 GPT-4，可以更可靠、更具创造力、能处理更细节的指令，根据图片和文字提示都能生成相应内容。具体来说来说，GPT-4 相比上一代的模型，实现了飞跃式提升：支持图像和文本输入，拥有强大的识图能力；大幅提升了文字输入限制，在ChatGPT模式下，GPT-4可以处理超过2.5万字的文本，可以处理一些更加细节的指令；回答准确性也得到了显著提高。

【ChatGPT系列报告】GPT-4及ChatGPT相关应用梳理，33页ppt

专知会员服务

327+阅读 · 2023年3月19日

GPT-4多模态大模型发布！98页《OpenAI GPT-4 技术报告》论文详细阐述！附下载（附151页技术报告中文版）

专知会员服务

554+阅读 · 2023年3月15日

【Google论文】ALBERT:自我监督学习语言表达的精简BERT

专知会员服务

24+阅读 · 2019年11月4日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日