Given that measuring food consumption at a population scale is a challenging task, researchers have begun to explore digital traces (e.g., from social media or from food-tracking applications) as potential proxies. However, it remains unclear to what extent digital traces reflect real food consumption. The present study aims to bridge this gap by quantifying the link between dietary behaviors as captured via social media (Twitter) v.s. a food-tracking application (MyFoodRepo). We focus on the case of Switzerland and contrast images of foods collected through the two platforms, by designing and deploying a novel crowdsourcing framework for estimating biases with respect to nutritional properties and appearance. We find that the food type distributions in social media v.s. food tracking diverge; e.g., bread is 2.5 times more frequent among consumed and tracked foods than on Twitter, whereas cake is 12 times more frequent on Twitter. Controlling for the different food type distributions, we contrast consumed and tracked foods of a given type with foods shared on Twitter. Across food types, food posted on Twitter is perceived as tastier, more caloric, less healthy, less likely to have been consumed at home, more complex, and larger-portioned, compared to consumed and tracked foods. The fact that there is a divergence between food consumption as measured via the two platforms implies that at least one of the two is not a faithful representation of the true food consumption in the general Swiss population. Thus, researchers should be attentive and aim to establish evidence of validity before using digital traces as a proxy for the true food consumption of a general population. We conclude by discussing the potential sources of these biases and their implications, outlining pitfalls and threats to validity, and proposing actionable ways for overcoming them.
翻译:鉴于衡量人口规模的粮食消费是一项具有挑战性的任务,研究人员已开始将数字痕迹(如社交媒体或食品跟踪应用)作为潜在的替代物来探索数字痕迹(如社会媒体或食品跟踪应用),然而,对于数字痕迹反映实际粮食消费的程度,仍不清楚。本研究的目的是通过量化社交媒体(Twitter)与食品跟踪应用(MyFoodRepo)所捕捉的饮食行为之间的联系来弥合这一差距。我们侧重于瑞士的情况,对比通过两个平台收集的食品图像,设计和部署新的众包框架,以估计营养特性和外观方面的偏差。我们发现,社会媒体与食品跟踪不同,食物类型分布在多少程度上是数字,食物分布比通过社交媒体(Twitter)多2.5倍,而蛋糕在推特上则更频繁12倍。控制不同食品类型分布,我们将消费和跟踪某类食物的流向与在Twitter上分享的食品。所有食物类型,在Twitter上张贴的食品被看成是塔丝、更卡路里、更准确、更不准确、更不健康、更接近于食物消费的渠道之间,因此更难测测测测算的食品来源。