Despite its great success, machine learning can have its limits when dealing with insufficient training data. A potential solution is the additional integration of prior knowledge into the training process which leads to the notion of informed machine learning. In this paper, we present a structured overview of various approaches in this field. We provide a definition and propose a concept for informed machine learning which illustrates its building blocks and distinguishes it from conventional machine learning. We introduce a taxonomy that serves as a classification framework for informed machine learning approaches. It considers the source of knowledge, its representation, and its integration into the machine learning pipeline. Based on this taxonomy, we survey related research and describe how different knowledge representations such as algebraic equations, logic rules, or simulation results can be used in learning systems. This evaluation of numerous papers on the basis of our taxonomy uncovers key methods in the field of informed machine learning.
翻译:尽管它取得了巨大成功,但机器学习在处理培训数据不足时却有其局限性。一个潜在的解决办法是将先前的知识进一步纳入培训过程,从而形成知情的机器学习概念。在本文件中,我们对该领域的各种办法进行了结构化的概述。我们为知情的机器学习提供了定义和概念,说明了其组成部分,并把它与常规的机器学习区别开来。我们引入了分类学框架,作为知情的机器学习方法的分类框架。它考虑了知识的来源、其代表性及其融入机器学习管道的情况。根据这一分类学,我们调查了相关的研究,并描述了如何在学习系统中使用代数方程式、逻辑规则或模拟结果等不同的知识表达方式。根据我们的分类法对大量论文进行了评估,发现了在知情的机器学习领域的关键方法。