Integrating heterogeneous data sources and expert knowledge is essential for overcoming data scarcity and enhancing estimation accuracy. Two main frameworks naturally arise to perform the integration of these multiple sources: sequential Bayesian inference and integrated models. The first one consists of updating posterior information in a sequential data analysis procedure, without the need to reanalyze previous data when new data become available. The second one consists of bringing together diverse sources of information in a joint inferential analysis through hierarchical Bayesian models. Within the context of the first framework, we propose a recursive inference method grounded in the methodological principles of INLA, designed to handle spatial and spatio-temporal problems, although its applicability is not limited to these cases, as the procedure is general in nature. Within the integrated models framework, we also present a comprehensive approach to address change of support issues that arise when combining heterogeneous information sources, developing a typology that classifies such changes as spatial, temporal, spatio-temporal, or categorical. Both frameworks can be combined, as there is neither a theoretical nor a practical incompatibility preventing their joint use. Finally, detailed examples are provided to illustrate clear and replicable procedures for combining heterogeneous data sources with change of support and recursive inference.
翻译:整合异构数据源与专家知识对于克服数据稀缺性及提升估计精度至关重要。实现多源信息融合主要存在两种框架:序贯贝叶斯推断与集成模型。前者通过序贯数据分析过程更新后验信息,无需在新数据可用时重新分析既往数据;后者则通过层次贝叶斯模型将多元信息源纳入联合推断分析。在序贯框架下,我们提出一种基于INLA方法学原理的递归推断方法,该方法专为处理空间与时空问题设计,但其适用性不限于此,因其本质具有普适性。在集成模型框架内,我们进一步提出系统性解决方案以应对异质信息源整合中出现的支撑域变更问题,构建了将此类变更归类为空间、时间、时空或范畴性的类型学体系。两种框架可协同使用,因其在理论与实践中均不存在互斥性。最后,本文通过详实案例阐明结合支撑域变更与递归推断的异构数据融合流程,确保方法的清晰性与可复现性。