Recently, ChatGPT, along with DALL-E-2 and Codex,has been gaining significant attention from society. As a result, many individuals have become interested in related resources and are seeking to uncover the background and secrets behind its impressive performance. In fact, ChatGPT and other Generative AI (GAI) techniques belong to the category of Artificial Intelligence Generated Content (AIGC), which involves the creation of digital content, such as images, music, and natural language, through AI models. The goal of AIGC is to make the content creation process more efficient and accessible, allowing for the production of high-quality content at a faster pace. AIGC is achieved by extracting and understanding intent information from instructions provided by human, and generating the content according to its knowledge and the intent information. In recent years, large-scale models have become increasingly important in AIGC as they provide better intent extraction and thus, improved generation results. With the growth of data and the size of the models, the distribution that the model can learn becomes more comprehensive and closer to reality, leading to more realistic and high-quality content generation. This survey provides a comprehensive review on the history of generative models, and basic components, recent advances in AIGC from unimodal interaction and multimodal interaction. From the perspective of unimodality, we introduce the generation tasks and relative models of text and image. From the perspective of multimodality, we introduce the cross-application between the modalities mentioned above. Finally, we discuss the existing open problems and future challenges in AIGC.
翻译:最近,查特格普特公司与DALL-E-2和codx公司一道,得到了社会的极大关注,因此,许多个人开始对相关资源感兴趣,并正在设法发现其令人印象深刻的业绩背后的背景和秘密,事实上,查特格普特公司和其他创意AI(GAI)技术属于人工智能生成内容(AIGC)类别,该类技术涉及通过AI模型制作图像、音乐和自然语言等数字内容。AIGC的目标是使内容制作过程更加高效和方便,从而更快地制作高质量的内容。AIGC通过从人类提供的指示中提取和理解意向信息,并根据知识和意图信息生成内容。近年来,大型模型在AIGT(AI)公司中越来越重要,因为这些模型提供了更好的意图提取,从而改善了生成结果。随着数据的增长和模型的规模,该模型能够更全面和更接近于现实的传播,从而导致更现实和更高质量的内容生成。AIGC通过从人类提供的指示提取和理解意图信息的意向信息,并根据其知识和意图信息生成内容。在AIG(GA)中,从最新的交互式互动模式和基本任务中,从我们提到的最新动态分析中,从ADI模型和基本任务中,从最近的新进展中,从ADI格式和基本任务中,从ADML(我们提到,从A)的演变中,从ADLL)的演变中,从新的历史,从新的分析中,从新的分析中,从A/LLLLLLLLLLLLLLL),从新的分析,从新的分析中,从新的分析,从新的分析,从新的分析中,从新的分析,从AFLLLL(AF-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-I-</s>