Spatial models of preference, in the form of vector embeddings, are learned by many deep learning and multiagent systems, including recommender systems. Often these models are assumed to approximate a Euclidean structure, where an individual prefers alternatives positioned closer to their "ideal point", as measured by the Euclidean metric. However, Bogomolnaia and Laslier (2007) showed that there exist ordinal preference profiles that cannot be represented with this structure if the Euclidean space has two fewer dimensions than there are individuals or alternatives. We extend this result, showing that there are realistic situations in which almost all preference profiles cannot be represented with the Euclidean model, and derive a theoretical lower bound on the expected error when using the Euclidean model to approximate non-Euclidean preference profiles. Our results have implications for the interpretation and use of vector embeddings, because in some cases close approximation of arbitrary, true ordinal relationships can be expected only if the dimensionality of the embeddings is a substantial fraction of the number of entities represented.
翻译:许多深层学习和多试剂系统,包括建议系统,都学习了以矢量嵌入为形式的空间偏好模型,这些模型往往被假定为接近欧几里得结构,其中个人更喜欢以欧几里得度度测量的“理想点”为“理想点”的替代品。然而,Bogomolnaia和Lasierer(2007年)表明,如果欧几里德空间比个人或替代品少两个维度,则这种结构中无法代表的偏爱特征特征特征。我们扩展了这一结果,表明存在现实的情况,即几乎所有的偏爱特征都无法与欧几里德模式相代表,而且当使用欧几里德模式来接近非欧几里得偏爱特征时,对预期的错误的理论约束程度较低。我们的结果对矢量嵌入的解释和使用具有影响,因为在某些情况下,只有嵌入的维度占所代表实体数目的相当一部分,才能指望这种任意、真实或丁关系接近。