Food recognition is an important task for a variety of applications, including managing health conditions and assisting visually impaired people. Several food recognition studies have focused on generic types of food or specific cuisines, however, food recognition with respect to Middle Eastern cuisines has remained unexplored. Therefore, in this paper we focus on developing a mobile friendly, Middle Eastern cuisine focused food recognition application for assisted living purposes. In order to enable a low-latency, high-accuracy food classification system, we opted to utilize the Mobilenet-v2 deep learning model. As some of the foods are more popular than the others, the number of samples per class in the used Middle Eastern food dataset is relatively imbalanced. To compensate for this problem, data augmentation methods are applied on the underrepresented classes. Experimental results show that using Mobilenet-v2 architecture for this task is beneficial in terms of both accuracy and the memory usage. With the model achieving 94% accuracy on 23 food classes, the developed mobile application has potential to serve the visually impaired in automatic food recognition via images.
翻译:食品识别是各种应用的重要任务,包括管理卫生条件和协助视力受损者。一些食品识别研究侧重于一般类型的食品或特定烹饪,然而,对于中东菜类的食品识别仍未得到探讨。因此,在本文件中,我们侧重于开发一个移动友好的、中东部菜类重点食品识别应用程序,用于辅助生活目的。为了能够建立一个低弹性、高精度食品分类系统,我们选择使用移动网-V2深层学习模式。由于一些食品比其他食品更受欢迎,使用过的中东食品数据集中的每类样本数量相对不平衡。为了弥补这一问题,在代表性不足的类别中采用了数据增强方法。实验结果表明,使用移动网-V2结构进行这项工作既有利于准确性,也有利于记忆使用。由于模型在23个食品类中实现了94%的准确性,开发的移动应用程序有可能为视觉受损者提供通过图像自动食品识别服务。