Ontology Matching (OM) plays an important role in many domains such as bioinformatics and the Semantic Web, and its research is becoming increasingly popular, especially with the application of machine learning (ML) techniques. Although the Ontology Alignment Evaluation Initiative (OAEI) represents an impressive effort for the systematic evaluation of OM systems, it still suffers from several limitations including limited evaluation of subsumption mappings, suboptimal reference mappings, and limited support for the evaluation of ML-based systems. To tackle these limitations, we introduce five new biomedical OM tasks involving ontologies extracted from Mondo and UMLS. Each task includes both equivalence and subsumption matching; the quality of reference mappings is ensured by human curation, ontology pruning, etc.; and a comprehensive evaluation framework is proposed to measure OM performance from various perspectives for both ML-based and non-ML-based OM systems. We report evaluation results for OM systems of different types to demonstrate the usage of these resources, all of which are publicly available as part of the new BioML track at OAEI 2022.
翻译:肿瘤匹配(OM)在许多领域发挥着重要作用,如生物信息学和语义网络,其研究越来越受欢迎,特别是机器学习技术的应用。虽然本体学协调评价倡议代表着对OM系统进行系统评估的令人印象深刻的努力,但它仍然受到若干限制,包括对次假设绘图的有限评价、次优参考绘图以及对基于ML系统的评估的支持有限。为了克服这些限制,我们引入了五项新的生物医学OM任务,涉及从Mondo和UMLS提取的本体学。每项任务包括等同和子包采匹配;参考绘图的质量通过人类校正、本体标定等得到保证;建议一个综合评价框架,从不同角度衡量基于ML和非基于ML的OM系统的业绩。我们报告不同类型OM系统的评价结果,以证明这些资源的使用,所有这些都公开作为OAEI 2022新的生物ML轨道的一部分。