The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of human. Despite tremendous success in the AI research, most of existing methods have only single-cognitive ability. To overcome this limitation and take a solid step towards artificial general intelligence (AGI), we develop a foundation model pre-trained with huge multimodal data, which can be quickly adapted for various downstream cognitive tasks. To achieve this goal, we propose to pre-train our foundation model by self-supervised learning with weak semantic correlation data crawled from the Internet and show that promising results can be obtained on a wide range of downstream tasks. Particularly, with the developed model-interpretability tools, we demonstrate that strong imagination ability is now possessed by our foundation model. We believe that our work makes a transformative stride towards AGI, from our common practice of "weak or narrow AI" to that of "strong or generalized AI".
翻译:人工智能(AI)的根本目标是模仿人类的核心认知活动。尽管人工智能研究取得了巨大成功,但大多数现有方法只有单一认知能力。为了克服这一局限性,并朝着人造一般智能(AGI)迈出坚实的一步,我们开发了一个基础模型,先经过大量多式数据的培训,可以迅速适应各种下游认知任务。为了实现这一目标,我们提议通过自我监督学习,利用从互联网上爬出的薄弱的语义相关数据,对我们的基础模型进行预先培训,并表明在一系列广泛的下游任务中可以取得有希望的结果。特别是,通过开发的模型解释工具,我们证明我们现在拥有强大的想象力,而我们的基础模型现在已经具备了这种想象力。我们认为我们的工作从我们“弱小人工智能”的常见做法到“强大或普遍人工智能”的常见做法,向人工智能转变了我们的工作。