The geometric median of a tuple of vectors is the vector that minimizes the sum of Euclidean distances to the vectors of the tuple. Classically called the Fermat-Weber problem and applied to facility location, it has become a major component of the robust learning toolbox. It is typically used to aggregate the (processed) inputs of different data providers, whose motivations may diverge, especially in applications like content moderation. Interestingly, as a voting system, the geometric median has well-known desirable properties: it is a provably good average approximation, it is robust to a minority of malicious voters, and it satisfies the "one voter, one unit force" fairness principle. However, what was not known is the extent to which the geometric median is strategyproof. Namely, can a strategic voter significantly gain by misreporting their preferred vector? We prove in this paper that, perhaps surprisingly, the geometric median is not even $\alpha$-strategyproof, where $\alpha$ bounds what a voter can gain by deviating from truthfulness. But we also prove that, in the limit of a large number of voters with i.i.d. preferred vectors, the geometric median is asymptotically $\alpha$-strategyproof. We show how to compute this bound $\alpha$. We then generalize our results to voters who care more about some dimensions. Roughly, we show that, if some dimensions are more polarized and regarded as more important, then the geometric median becomes less strategyproof. Interestingly, we also show how the skewed geometric medians can improve strategyproofness. Nevertheless, if voters care differently about different dimensions, we prove that no skewed geometric median can achieve strategyproofness for all. Overall, our results constitute a coherent set of insights into the extent to which the geometric median is suitable to aggregate high-dimensional disagreements.
翻译:矢量图的几何中位值是最小化 Euclidean 距离与 Tuple 矢量的距离之和的矢量。 典型地称Fermat- Weber 问题, 并适用于设施位置, 它已成为强健学习工具箱的一个主要组成部分。 通常用来汇总不同数据提供者的( 处理的) 输入, 其动机可能不同, 特别是在内容调适等应用程序中。 有趣的是, 几何中位值具有众所周知的可取性: 它是一个可辨别的准确平均近似值, 它对少数恶意选民来说是强大的, 而且它满足了“ 单一选民, 一单位力量” 公平原则。 然而, 尚不知道的是, 几何中位中位中位值中位值是战略的防偏差。 我们的几何中位值中位值中位值中位值甚至不是美元, 我们的正位值中位值中位值是多少。