Under ideal conditions, the probability density function (PDF) of a random variable, such as a sensor measurement, would be well known and amenable to computation and communication tasks. However, this is often not the case, so the user looks for some other PDF that approximates the true but intractable PDF. Conservativeness is a commonly sought property of this approximating PDF, especially in distributed or unstructured data systems where the data being fused may contain un-known correlations. Roughly, a conservative approximation is one that overestimates the uncertainty of a system. While prior work has introduced some definitions of conservativeness, these definitions either apply only to normal distributions or violate some of the intuitive appeal of (Gaussian) conservative definitions. This work provides a general and intuitive definition of conservativeness that is applicable to any probability distribution that is a measure over $\mathbb{R}^m$ or an infinite subset thereof, including multi-modal and uniform distributions. Unfortunately, we show that this \emph{strong} definition of conservative cannot be used to evaluate data fusion techniques. Therefore, we also describe a weaker definition of conservative and show it is preserved through common data fusion methods such as the linear and log-linear opinion pool, and homogeneous functionals, \rev{assuming the input distributions can be factored into independent and common distributions}. In addition, we show that after fusion, weak conservativeness is preserved by Bayesian updates. These strong and weak definitions of conservativeness can help design and evaluate potential correlation-agnostic data fusion techniques.
翻译:在理想条件下,随机变量(如传感器测量)的概率密度函数(PDF)将广为人知,并易于计算和通信任务。然而,这种情况往往不是如此,因此用户寻找一些其他接近真实但棘手的 PDF 的PDF。保守性是这种近似PDF的常见属性,特别是在分布式或非结构化数据结合数据可能包含未知关联的数据的分布或非结构化数据系统中。粗略地,保守近似是高估系统不确定性的近似值。虽然先前的工作引入了一些保守性定义,但这些定义要么只适用于正常的分布,要么违反(加西安)保守性定义的一些直观性吸引力。因此,这项工作提供了一种一般和直观性保守性定义,适用于任何概率分布的尺度,包括多模式和统一的分布。不幸的是,我们显示这种保守性定义不能用于评估数据流的稳定性,因此,我们通过常规性设计技术来显示一个较弱的稳定性定义。我们通过常规性数据分配和直线性统计方法来显示一个较弱的稳定性定义。