Factorization machine (FM) is a prevalent approach to modeling pairwise (second-order) feature interactions when dealing with high-dimensional sparse data. However, on the one hand, FM fails to capture higher-order feature interactions suffering from combinatorial expansion, on the other hand, taking into account interaction between every pair of features may introduce noise and degrade prediction accuracy. To solve the problems, we propose a novel approach Graph Factorization Machine (GraphFM) by naturally representing features in the graph structure. In particular, a novel mechanism is designed to select the beneficial feature interactions and formulate them as edges between features. Then our proposed model which integrates the interaction function of FM into the feature aggregation strategy of Graph Neural Network (GNN), can model arbitrary-order feature interactions on the graph-structured features by stacking layers. Experimental results on several real-world datasets has demonstrated the rationality and effectiveness of our proposed approach.
翻译:集成机(FM)是处理高维分散数据时对称(第二阶)特征互动的一种普遍模式,但是,一方面,调频未能捕捉到因组合扩展而导致的较高阶特征互动,另一方面,考虑到每对特征之间的相互作用,可能会带来噪音和降低预测准确性。为了解决问题,我们建议采用新颖的方法,通过自然代表图形结构中的特征来解决问题。特别是,设计了一个新机制,以选择有益的特征互动,并将这些互动作为特征之间的边缘。然后,我们提出的模式,将调频的互动功能纳入图形神经网络(GNN)的特征汇总战略,可以通过堆叠层模拟图形结构特征上的任意顺序特征互动。一些真实世界数据集的实验结果显示了我们拟议方法的合理性和有效性。