The general reinforcement learning agent AIXI may be the only mathematical formalism of artificial general intelligence (AGI) supported by proof that its performance is optimal. This is achieved using compression as a proxy for intelligence. Unfortunately, AIXI is incomputable and claims of its optimality were later shown to be subjective. This paper proposes an alternative, supported by proof, which overcomes both problems. Integrating research from cognitive science (enactivism), philosophy (intension and extension), machine learning and planning (satplan), an arbitrary task is given mathematical rigour. This serves as an enactive model of learning and reasoning within which a description of intelligence is formalised. Instead of compression we use weakness as our proxy, and the result is both computable and objective. We formally prove that maximising weakness maximises intelligence. This proof is then further supported with experimental results comparing weakness and description length (the closest analogue to compression possible under the enactive model that would not reintroduce the problem of subjective performance). Our results show that weakness outperforms description length, and is a better proxy for intelligence. The foremost limitation is that intelligence as we have defined it is computationally complex, but may be coupled with domain specific inductive biases to make real-world domains of practical significance tractable (e.g. the domain of all tasks a human would undertake). Like AIXI this is not intended to be a panacea but to demonstrate useful principles, which may be integrated with existing tools such as neural networks to improve performance.
翻译:通用强化学习代理 AIXI 可能是人造一般智能的唯一数学形式形式主义, 并辅以其表现最佳的证明。 这是用压缩作为情报的替代物实现的。 不幸的是, AIXI是不可比较的, 其最佳性主张后来被证明是主观的。 本文提出了一个替代方案, 辅以证据, 克服了这两个问题。 将认知科学( 活动主义)、 哲学( 强化和扩展)、 机器学习和规划( 卫星计划) 的研究结合起来, 一个任意的任务被赋予数学的严谨性。 这可以作为学习和推理的生动模式, 其中将情报描述正规化。 我们不是将弱点压缩作为我们的代理,而是将结果压缩为可折中和客观的结果。 我们正式证明, 最大限度的弱点最大化将智能最大化智能。 然后用实验结果进一步支持, 比较弱点和描述长度( 最接近于压缩可能不会重新出现主观性表现问题的定型模型 ) 我们的结果显示, 弱点超越了描述的长度, 更准确地说, 也就是, 智能与实际的网络一样, 我们定义了它的真实性, 是一个精确的轨道 。