In this paper, we introduce a novel unsupervised, graph-based filter feature selection technique which exploits the power of topologically constrained network representations. We model dependency structures among features using a family of chordal graphs (the Triangulated Maximally Filtered Graph), and we maximise the likelihood of features' relevance by studying their relative position inside the network. Such an approach presents three aspects that are particularly satisfactory compared to its alternatives: (i) it is highly tunable and easily adaptable to the nature of input data; (ii) it is fully explainable, maintaining, at the same time, a remarkable level of simplicity; (iii) it is computationally cheaper compared to its alternatives. We test our algorithm on 16 benchmark datasets from different applicative domains showing that it outperforms or matches the current state-of-the-art under heterogeneous evaluation conditions.
翻译:在本文中,我们介绍了一种新颖的无监督、基于图过滤的特征选择技术,利用了拓扑约束网络表示的优势。我们使用一个家族的弦图(三角形最大过滤图)对特征之间的依赖关系进行建模,并通过研究它们在网络中的相对位置来最大化特征相关性的可能性。这种方法具有三个特点,特别是与其替代方案相比:(i)它非常可调,并且易于根据输入数据的性质进行适应;(ii)它是完全可解释的,同时保持了显著的简单性;(iii)它在计算上比其替代方案更便宜。我们在来自不同应用领域的16个基准数据集上测试了我们的算法,显示出在异构评估条件下,它优于或与当前的技术水平相匹配。