The automatic recognition of food on images has numerous interesting applications, including nutritional tracking in medical cohorts. The problem has received significant research attention, but an ongoing public benchmark to develop open and reproducible algorithms has been missing. Here, we report on the setup of such a benchmark using publicly available food images sourced through the mobile MyFoodRepo app. Through four rounds, the benchmark released the MyFoodRepo-273 dataset constituting 24,119 images and a total of 39,325 segmented polygons categorized in 273 different classes. Models were evaluated on private tests sets from the same platform with 5,000 images and 7,865 annotations in the final round. Top-performing models on the 273 food categories reached a mean average precision of 0.568 (round 4) and a mean average recall of 0.885 (round 3). We present experimental validation of round 4 results, and discuss implications of the benchmark setup designed to increase the size and diversity of the dataset for future rounds.
翻译:照片上食物的自动识别有许多有趣的应用,包括医疗组群的营养跟踪。这个问题已经受到大量的研究关注,但缺乏开发开放和可复制算法的持续公共基准。在这里,我们报告使用移动的MyFoodRepo 应用软件提供的公开食物图像建立这样一个基准。通过四轮,基准释放了MyFoodRepo-273数据集,其中包括24 119个图像和总共39 325个分为273个不同类别的多边形块。在同一平台的私人测试组上,对模型进行了评价,该模型有5 000个图像和7 865个说明,最后一轮中,273个食品类的顶级模型平均精确度达到0.568(轮4),平均回顾0.885(轮3),我们提出对第4轮结果的试验性验证,并讨论旨在增加今后各轮数据规模和多样性的基准设置的影响。