AIGenC: AI通过创造性概括 (AIGenC: AI generalisation via creativity)

This paper introduces a computational model of creative problem-solving in deep reinforcement learning agents, inspired by cognitive theories of creativity. The AIGenC model aims at enabling artificial agents to learn, use and generate transferable representations. AIGenC is embedded in a deep learning architecture that includes three main components: concept processing, reflective reasoning, and blending of concepts. The first component extracts objects and affordances from sensory input and encodes them in a concept space, represented as a hierarchical graph structure. Concept representations are stored in a dual memory system. Goal-directed and temporal information acquired by the agent during deep reinforcement learning enriches the representations creating a higher level of abstraction in the concept space. In parallel, a process akin to reflective reasoning detects and recovers from memory concepts relevant to the task according to a matching process that calculates a similarity value between the current state and memory graph structures. Once an interaction is finalised, rewards and temporal information are added to the graph structure, creating a higher abstraction level. If reflective reasoning fails to offer a suitable solution, a blending process comes into place to create new concepts by combining past information. We discuss the model's capability to yield better out-of-distribution generalisation in artificial agents, thus advancing toward artificial general intelligence. To the best of our knowledge, this is the first computational model, beyond mere formal theories, that posits a solution to creative problem solving within a deep learning architecture.

翻译：本文引入了在深强化学习媒介中创造性解决问题的计算模型,该模型的灵感来自认知的创造理论。AIGenC模型旨在使人工代理人能够学习、使用和产生可转移的演示。AIGenC嵌入一个包含三个主要组成部分的深层次学习结构中:概念处理、反思推理和概念混合。第一个组成部分从感官输入中提取对象和负担,并在概念空间中将它们编码成一个等级图结构。概念表示存储在双重记忆系统中。在深强化学习期间,由该代理人获得的目标指导和时间信息丰富了在概念空间中创造更高程度的抽象形象。同时,一个类似于反思推理的过程在与任务相关的记忆概念中探测和恢复的深层学习结构,根据一个计算当前状态和记忆图结构之间类似价值的匹配过程。一旦互动最终确定,奖励和时间信息被添加到图形结构中,创造出更高的抽象程度。如果反思推论无法提供合适的解决方案,那么一个简单的混合过程就会通过将过去的理论空间中的深层次概念创造出更深层次的概念。同时,一个类似于将过去的理论推算出一个模型,从而推算出我们总体的理论的模型的模型,从而推算出一个更深层次的理论的模型的模型的推算。