猜测Karma:评估社会评级制度的游戏 (GuessTheKarma: A Game to Assess Social Rating Systems)

Popularity systems, like Twitter retweets, Reddit upvotes, and Pinterest pins have the potential to guide people toward posts that others liked. That, however, creates a feedback loop that reduces their informativeness: items marked as more popular get more attention, so that additional upvotes and retweets may simply reflect the increased attention and not independent information about the fraction of people that like the items. How much information remains? For example, how confident can we be that more people prefer item A to item B if item A had hundreds of upvotes on Reddit and item B had only a few? We investigate using an Internet game called GuessTheKarma that collects independent preference judgments (N=20,674) for 400 pairs of images, approximately 50 per pair. Unlike the rating systems that dominate social media services, GuessTheKarma is devoid of social and ranking effects that influence ratings. Overall, Reddit scores were not very good predictors of the true population preferences for items as measured by GuessTheKarma: the image with higher score was preferred by a majority of independent raters only 68% of the time. However, when one image had a low score and the other was one of the highest scoring in its subreddit, the higher scoring image was preferred nearly 90% of the time by the majority of independent raters. Similarly, Imgur view counts for the images were poor predictors except when there were orders of magnitude differences between the pairs. We conclude that popularity systems marked by feedback loops may convey a strong signal about population preferences, but only when comparing items that received vastly different popularity scores.

翻译：广度系统,如Twitter retweets、 Reddit upvotes 和 Pinternational Pentes 等信息系统, 有可能引导人们找到其他人喜欢的海报。然而, 这创造了一个反馈循环, 降低了他们的信息量: 被标记为更受欢迎的项目得到更多关注, 因而更多的高音和retweets可能只是反映对喜欢项目的人的分数的更多关注而不是独立信息。信息还有多少? 比如, 如果A项在Redddit上有数百个高音, B项只有几个? 我们使用一个名为GuessTheKarma的互联网游戏来调查, 收集400对图像的独立的偏好判断( N=20 674 ), 从而降低他们的信息量: 与主导社会媒体服务的评级系统不同, GuessTheKarma 缺乏影响评级的社会和排名效应。总体来说, REddit 分数并不很好地预测那里对项目的真正人口偏好, 而B项的评分只有大多数独立评分只有68 % 。然而, 当一个图像的评分为最低的评分时, 而其他的评分为最低的评分时,