Conventional machine learning studies generally assume close-environment scenarios where important factors of the learning process hold invariant. With the great success of machine learning, nowadays, more and more practical tasks, particularly those involving open-environment scenarios where important factors are subject to change, called open-environment machine learning (Open ML) in this article, are present to the community. Evidently it is a grand challenge for machine learning turning from close environment to open environment. It becomes even more challenging since, in various big data tasks, data are usually accumulated with time, like streams, while it is hard to train the machine learning model after collecting all data as in conventional studies. This article briefly introduces some advances in this line of research, focusing on techniques concerning emerging new classes, decremental/incremental features, changing data distributions, varied learning objectives, and discusses some theoretical issues.
翻译:常规机器学习研究一般假设近环境情景,学习过程的重要因素在其中始终存在。如今,随着机器学习的巨大成功,越来越多的、更加实际的任务,尤其是那些涉及开放环境情景的任务,其中一些重要因素可能发生变化,即本篇文章中所谓的开放环境机器学习(开放ML),已经向社区展示。显然,这是机器学习从近环境转向开放环境的一个重大挑战。由于在各种大数据任务中,数据通常随时间而积累,如流流,而在常规研究中收集所有数据后,很难对机器学习模式进行培训。这一文章简要介绍了这一研究领域的一些进展,重点是关于新兴班级、下降/入门特征、改变数据分布、不同学习目标的技术,并讨论一些理论问题。