77% of adults over 50 want to age in place today, presenting a major challenge to ensuring adequate nutritional intake. It has been reported that one in four older adults that are 65 years or older are malnourished and given the direct link between malnutrition and decreased quality of life, there have been numerous studies conducted on how to efficiently track nutritional intake of food. Recent advancements in machine learning and computer vision show promise of automated nutrition tracking methods of food, but require a large high-quality dataset in order to accurately identify the nutrients from the food on the plate. Unlike existing datasets, a collection of 3D models with nutritional information allow for view synthesis to create an infinite number of 2D images for any given viewpoint/camera angle along with the associated nutritional information. In this paper, we develop a methodology for collecting high-quality 3D models for food items with a particular focus on speed and consistency, and introduce NutritionVerse-3D, a large-scale high-quality high-resolution dataset of 105 3D food models, in conjunction with their associated weight, food name, and nutritional value. These models allow for large quantity food intake scenes, diverse and customizable scene layout, and an infinite number of camera settings and lighting conditions. NutritionVerse-3D is publicly available as a part of an open initiative to accelerate machine learning for nutrition sensing.
翻译:今天超过50岁的成年人中有77%希望能够在家中养老,这给确保充足营养摄入带来了巨大的挑战。据报道,65岁或以上的老年人中有四分之一营养不良,饮食营养与生活质量的直接关系表明有许多研究致力于如何高效地跟踪饮食营养摄入。最近机器学习和计算机视觉的进展显示了自动化食品营养追踪方法的潜力,但需要一个大型高质量的数据集才能准确识别盘子里的营养成分。与现有数据集不同,一组具有营养信息的三维模型可进行视图合成,为任意视点/摄像机角度创建无限数量的2D图像以及相关的营养信息。在本文中,我们开发了一种采集高质量3D食品模型的方法,特别关注速度和一致性,并推出了NutritionVerse-3D,一个包括105个高质量、高分辨率3D食品模型及其关联的重量、食品名称和营养价值的大规模数据集。这些模型允许大量食物摄入场景、多样化和可定制的场景布局以及无限数量的相机设置和照明条件。作为加速营养感应机器学习的开放倡议的一部分,NutritionVerse-3D是公开可用的。