In this paper, we report our participation in the Task 2: Triple Scoring of WSDM Cup challenge 2017. In this task, we were provided with triples of "type-like" relations which were given human-annotated relevance scores ranging from 0 to 7, with 7 being the "most relevant" and 0 being the "least relevant". The task focuses on two such relations: profession and nationality. We built a system which could automatically predict the relevance scores for unseen triples. Our model is primarily a supervised machine learning based one in which we use well-designed features which are used to a make a Logistic Ordinal Regression based classification model. The proposed system achieves an overall accuracy score of 0.73 and Kendall's tau score of 0.36.
翻译:在本文中,我们报告了我们参加任务2:WSDM杯挑战2017的三重分数。在这项任务中,我们得到了三重“类型式”关系,给予其人文附加说明的相关性分数从0到7不等,7分是“最相关”的分数,0分是“最相关”的分数。任务集中在两种关系上:职业和国籍。我们建立了一个系统,可以自动预测不可见三重的分数。我们的模型主要是一个监督的机器学习,其基础是我们使用设计完善的特征,用来制作一个物流半反向性分类模型。提议的系统总精度分数为0.73,Kendall的评分为0.36。