When aggregating information from conflicting sources, one's goal is to find the truth. Most real-value \emph{truth discovery} (TD) algorithms try to achieve this goal by estimating the competence of each source and then aggregating the conflicting information by weighing each source's answer proportionally to her competence. However, each of those algorithms requires more than a single source for such estimation and usually does not consider different estimation methods other than a weighted mean. Therefore, in this work we formulate, prove, and empirically test the conditions for an Empirical Bayes Estimator (EBE) to dominate the weighted mean aggregation. Our main result demonstrates that EBE, under mild conditions, can be used as a second step of any TD algorithm in order to reduce the expected error.
翻译:当汇集来自相互矛盾来源的信息时,我们的目标是找出真相。大多数真实价值的计算法试图实现这一目标,方法是估计每个来源的能力,然后根据每个来源的能力按比例权衡其答复,然后汇总相互矛盾的信息。然而,每一种计算法都需要不止一个来源来进行这种估计,而且通常不考虑除加权平均值以外的不同估计方法。因此,在这项工作中,我们制定、证明并用经验测试了环境实验Bayes Estimator(EBE)控制加权平均值总和的条件。我们的主要结果显示,在温和的条件下,EBE可以作为任何TD算法的第二步,以减少预期的错误。