Data representation is often of the natural form with their attribute values. To utilize the data efficiently, one needs to well understand the observed attribute values and identify the potential useful information in the data/smaples, or training data. In this paper, a new data representation, named as hyper-classes representation, is proposed for improving recommendation. At first, the cross entropy, KL divergence and JS divergence of features in data are defined. And then, the hyper-classes in data can be discovered with these three parameters. Finally, a kind of recommendation algorithm is used to evaluate the proposed hyper-class representation of data, and shows that the hyper-class representation is able to provide truly useful reference information for recommendation systems and makes recommendations much better than existing algorithms, i.e., this approach is efficient and promising.
翻译:数据表示形式往往是自然形式及其属性值。 为了有效地利用数据,人们需要很好地理解观察到的属性值,并查明数据/标本或培训数据中的潜在有用信息。在本文中,提议采用称为超等表示的新数据表示形式来改进建议。首先,界定了数据特征的交叉加密、 KL 差异和 JS 差异。然后,用这三个参数可以发现数据中的超等。最后,使用一种建议算法来评价拟议的超等数据表示形式,并表明超等表示能够为建议系统提供真正有用的参考信息,并提出建议比现有的算法要好得多,也就是说,这种方法是有效和有希望的。