Representing data by means of graph structures identifies one of the most valid approach to extract information in several data analysis applications. This is especially true when multimodal datasets are investigated, as records collected by means of diverse sensing strategies are taken into account and explored. Nevertheless, classic graph signal processing is based on a model for information propagation that is configured according to heat diffusion mechanism. This system provides several constraints and assumptions on the data properties that might be not valid for multimodal data analysis, especially when large scale datasets collected from heterogeneous sources are considered, so that the accuracy and robustness of the outcomes might be severely jeopardized. In this paper, we introduce a novel model for graph definition based on fluid diffusion. The proposed approach improves the ability of graph-based data analysis to take into account several issues of modern data analysis in operational scenarios, so to provide a platform for precise, versatile, and efficient understanding of the phenomena underlying the records under exam, and to fully exploit the potential provided by the diversity of the records in obtaining a thorough characterization of the data and their significance. In this work, we focus our attention to using this fluid diffusion model to drive a community detection scheme, i.e., to divide multimodal datasets into many groups according to similarity among nodes in an unsupervised fashion. Experimental results achieved by testing real multimodal datasets in diverse application scenarios show that our method is able to strongly outperform state-of-the-art schemes for community detection in multimodal data analysis.
翻译:以图表结构方式代表数据,可以确定在若干数据分析应用中提取信息的最有效方法之一,在调查多式数据集时尤其如此,因为通过不同遥感战略收集的记录得到考虑和探索;然而,典型的图形信号处理是基于一个信息传播模型,该模型是根据热扩散机制配置的信息传播模型,这个系统为数据属性提供了若干限制和假设,对于多式联运数据分析可能无效,特别是在考虑从不同来源收集的大型数据集时,这样的结果的准确性和稳健性可能受到严重危害。在本文中,我们引入了一个基于流体传播的图表定义新模型。拟议方法提高了基于图形的数据分析能力,以考虑到业务情景中现代数据分析的若干问题,从而为准确、多功能和高效地理解正在测试的记录所根据的现象提供了一个平台,并充分利用记录多样性提供的潜力,对数据及其重要性进行彻底的定性。我们把注意力集中在使用这种流体传播模型,以驱动社区检测计划,即基于流体传播流体的图表分析。 拟议的方法提高了基于图表的数据分析的功能,从而通过模拟模型分析,将我们所实现的模型模型的模型分析结果分为不同的模型,从而将不同形式的模型分解。