Machine learning has long since become a keystone technology, accelerating science and applications in a broad range of domains. Consequently, the notion of applying learning methods to a particular problem set has become an established and valuable modus operandi to advance a particular field. In this article we argue that such an approach does not straightforwardly extended to robotics -- or to embodied intelligence more generally: systems which engage in a purposeful exchange of energy and information with a physical environment. In particular, the purview of embodied intelligent agents extends significantly beyond the typical considerations of main-stream machine learning approaches, which typically (i) do not consider operation under conditions significantly different from those encountered during training; (ii) do not consider the often substantial, long-lasting and potentially safety-critical nature of interactions during learning and deployment; (iii) do not require ready adaptation to novel tasks while at the same time (iv) effectively and efficiently curating and extending their models of the world through targeted and deliberate actions. In reality, therefore, these limitations result in learning-based systems which suffer from many of the same operational shortcomings as more traditional, engineering-based approaches when deployed on a robot outside a well defined, and often narrow operating envelope. Contrary to viewing embodied intelligence as another application domain for machine learning, here we argue that it is in fact a key driver for the advancement of machine learning technology. In this article our goal is to highlight challenges and opportunities that are specific to embodied intelligence and to propose research directions which may significantly advance the state-of-the-art in robot learning.
翻译:长期以来,机器学习已成为一项关键技术,在广泛的领域加速了科学和应用,因此,将学习方法应用于特定问题集的概念已成为推动特定领域的一个既定和宝贵的工作方式。在本条中,我们争辩说,这种方法并不直接延伸至机器人 -- -- 或更广义地体现情报:与物理环境有针对性地交流能源和信息的系统。特别是,内含智能剂的范围大大超出了主流机器学习方法的典型考虑范围,通常(一)不认为在与培训期间遇到的情况大不相同的条件下操作;(二)不认为学习和部署期间互动经常具有实质性、长期性和潜在安全-关键性质;(三)不要求随时适应新任务,而同时(四)通过有针对性和审慎的行动,有效和高效地调整和扩展其世界模式。因此,事实上,这些限制导致以学习为基础的系统在很多业务上存在许多与在培训期间遇到的情况大不相同的缺陷;(二)不认为,在学习和部署于明确界定的机器人时,往往认为,在操作过程中互动往往具有实质性的、长期和潜在的安全-关键-关键-关键-关键-关键-关键-是,我们从机器学习到学习到学习的另一种技术。