When humans perform inductive learning, they often enhance the process with background knowledge. With the increasing availability of well-formed collaborative knowledge bases, the performance of learning algorithms could be significantly enhanced if a way were found to exploit these knowledge bases. In this work, we present a novel algorithm for injecting external knowledge into induction algorithms using feature generation. Given a feature, the algorithm defines a new learning task over its set of values, and uses the knowledge base to solve the constructed learning task. The resulting classifier is then used as a new feature for the original problem. We have applied our algorithm to the domain of text classification using large semantic knowledge bases. We have shown that the generated features significantly improve the performance of existing learning algorithms.
翻译:当人类进行感化学习时,他们往往用背景知识来强化过程。随着形成良好的协作知识库的日益普及,如果找到一种利用这些知识库的方法,学习算法的性能可以大大提高。在这项工作中,我们提出了一个将外部知识注入到使用特性生成的感化算法中的新型算法。一个特征是,算法界定了一套新的学习任务,并使用知识库来解决所构建的学习任务。随后,由此形成的分类器被用作原始问题的新特征。我们利用大型语义知识库将我们的算法应用于文本分类领域。我们已经表明,所产生的特征大大改善了现有学习算法的性能。