Conventional machine learning studies generally assume close world scenarios where important factors of the learning process hold invariant. With the great success of machine learning, nowadays, more and more practical tasks, particularly those involving open world scenarios where important factors are subject to change, called open environment machine learning (Open ML) in this article, are present to the community. Evidently it is a grand challenge for machine learning turning from close world to open world. It becomes even more challenging since, in various big data tasks, data are usually accumulated with time, like streams, while it is hard to train the machine learning model after collecting all data as in conventional studies. This article briefly introduces some advances in this line of research, focusing on techniques concerning emerging new classes, decremental/incremental features, changing data distributions, varied learning objectives, and discusses some theoretical issues.
翻译:常规机器学习研究通常假设世界环境接近,而学习过程的重要因素却始终存在。如今,随着机器学习的巨大成功,越来越多的、更加实际的任务,尤其是那些涉及开放世界、重要因素可能发生变化的情景,即本篇文章中称为开放环境机器学习(开放式ML)的任务,已经向社会展示。显然,这是机器学习从近距离世界转向开放世界的巨大挑战。由于在各种大数据任务中,数据通常随时间而积累,如流,而收集传统研究中的所有数据后,很难对机器学习模式进行培训。本文章简要介绍了这一研究领域的一些进展,重点是关于新兴班级、消化/入门特征、改变数据分布、不同学习目标以及讨论一些理论问题的技术。