The SemanticWeb emerged as an extension to traditionalWeb, towards adding meaning (semantics) to a distributed Web of structured and linked data. At its core, the concept of ontology provides the means to semantically describe and structure information and data and expose it to software and human agents in a machine and human-readable form. For software agents to be realized, it is crucial to develop powerful artificial intelligence and machine learning techniques, able to extract knowledge from information and data sources and represent it in the underlying ontology. This survey aims to provide insight into key aspects of ontology-based knowledge extraction, from various sources such as text, images, databases and human expertise, with emphasis on the task of feature selection. First, some of the most common classification and feature selection algorithms are briefly presented. Then, selected methodologies, which utilize ontologies to represent features and perform feature selection and classification, are described. The presented examples span diverse application domains, e.g., medicine, tourism, mechanical and civil engineering, and demonstrate the feasibility and applicability of such methods.
翻译:文理学概念的核心是,本体学概念提供了对信息和数据进行语义描述和结构构建的手段,并以机器和人文可读的形式将其暴露于软件和人体代理物中。为了实现软件代理物,必须开发强大的人工智能和机器学习技术,能够从信息和数据来源中提取知识,并在基本本体学中加以体现。这项调查旨在深入了解基于本体知识的提取的关键方面,包括从文字、图像、数据库和人类专门知识等各种来源提取知识,重点是地物选择任务。首先,简要介绍了一些最常见的分类和地物选择算法。然后,介绍了一些方法,这些方法利用本体来代表特征并进行地物选择和分类。列举的例子涉及多种应用领域,例如医学、旅游、机械和土木工程,并展示了这些方法的可行性和适用性。