During the last two years there has been a plethora of large generative models such as ChatGPT or Stable Diffusion that have been published. Concretely, these models are able to perform tasks such as being a general question and answering system or automatically creating artistic images that are revolutionizing several sectors. Consequently, the implications that these generative models have in the industry and society are enormous, as several job positions may be transformed. For example, Generative AI is capable of transforming effectively and creatively texts to images, like the DALLE-2 model; text to 3D images, like the Dreamfusion model; images to text, like the Flamingo model; texts to video, like the Phenaki model; texts to audio, like the AudioLM model; texts to other texts, like ChatGPT; texts to code, like the Codex model; texts to scientific texts, like the Galactica model or even create algorithms like AlphaTensor. This work consists on an attempt to describe in a concise way the main models are sectors that are affected by generative AI and to provide a taxonomy of the main generative models published recently.
翻译:在过去的两年里,出现了大量大型的基因模型,如ChatGPT或稳定传播等,这些模型已经出版。具体来说,这些模型能够执行一些任务,例如,一个一般性的问题和回答系统,或者自动制作艺术图像,使几个部门发生革命性的变化。因此,这些基因模型在工业和社会中的影响是巨大的,因为若干职位可能发生转变。例如,创世的AI能够有效和创造性地将文字转换成图像,如DALLE-2模型;文字到3D图像,如Dreamlution模型;文字到文字的文本,如Flamingo模型;图像到文字的文本,如Flamingo模型;视频的文本,如Phenaki模型;音频的文本,如音频LM模型;文本到其他文本,如ChatGPT;代码的文本,如代码模型;科学文本的文本,如Galactica模型,甚至像Alph Tensor这样的算法。这项工作包括试图以简洁的方式描述主要模型是受基因化AI影响的部门,并提供最近出版的主要基因模型的分类学。