A machine intelligence pipeline usually consists of six components: problem, representation, model, loss, optimizer and metric. Researchers have worked hard trying to automate many components of the pipeline. However, one key component of the pipeline--problem definition--is still left mostly unexplored in terms of automation. Usually, it requires extensive efforts from domain experts to identify, define and formulate important problems in an area. However, automatically discovering research or application problems for an area is beneficial since it helps to identify valid and potentially important problems hidden in data that are unknown to domain experts, expand the scope of tasks that we can do in an area, and even inspire completely new findings. This paper describes Problem Learning, which aims at learning to discover and define valid and ethical problems from data or from the machine's interaction with the environment. We formalize problem learning as the identification of valid and ethical problems in a problem space and introduce several possible approaches to problem learning. In a broader sense, problem learning is an approach towards the free will of intelligent machines. Currently, machines are still limited to solving the problems defined by humans, without the ability or flexibility to freely explore various possible problems that are even unknown to humans. Though many machine learning techniques have been developed and integrated into intelligent systems, they still focus on the means rather than the purpose in that machines are still solving human defined problems. However, proposing good problems is sometimes even more important than solving problems, because a good problem can help to inspire new ideas and gain deeper understandings. The paper also discusses the ethical implications of problem learning under the background of Responsible AI.
翻译:机器情报管道通常由六个组成部分组成:问题、代表性、模型、损失、优化和计量。研究人员努力设法使管道中许多组成部分自动化。然而,管道-问题定义的一个关键组成部分 -- -- 在自动化方面仍大多未探讨。通常,它需要领域专家作出大量努力,以查明、界定和提出一个领域的重要问题。然而,一个领域的自动发现研究或应用问题是有益的,因为它有助于查明数据中隐藏的、对域专家来说并不熟悉的有效和潜在的重要问题,扩大了我们可以在一个领域完成的任务的范围,甚至激发了全新的发现。本文描述了“问题学习”的一个关键组成部分,其目的是从数据或机器与环境的互动中发现和界定有效的和道德问题。我们把问题作为在问题空间查明、定义和提出重要问题,并采用几种可能的方法来学习问题。从更广的意义上讲,问题学习是走向智能机器的自由意志。目前,机器仍然局限于解决人类所定义的问题,没有能力或灵活性来自由探索数据或机器与环境的相互作用,因此,在智能的系统下,要学会各种可能的问题,而人类的解决的方法更是未知的。