This document describes a possible approach that can be used to check the relevance of a summary / definition of an entity with respect to its name. This classifier focuses on the relevancy of an entity's name to its summary / definition, in other words, it is a name relevance check. The percentage score obtained from this approach can be used either on its own or used to supplement scores obtained from other metrics to arrive upon a final classification; at the end of the document, potential improvements have also been outlined. The dataset that this document focuses on achieving an objective score is a list of package names and their respective summaries (sourced from pypi.org).
翻译:本文件介绍了一种可能的办法,可用来检查某一实体的概要/定义与其名称的相关性。该分类器侧重于某一实体的名称与其概要/定义的相关性,换句话说,这是一个名称相关性检查。从这一方法中获得的百分数可以单独使用,也可以用来补充从其他指标中获得的分数,以达成最后分类;在文件末尾,还概述了可能的改进。本文件侧重于实现客观分数的数据集是一套组合名称及其各自摘要的清单(来源于pypi.org)。