While automatic tracking and measuring of our physical activity is a well established domain, not only in research but also in commercial products and every-day life-style, automatic measurement of eating behavior is significantly more limited. Despite the abundance of methods and algorithms that are available in bibliography, commercial solutions are mostly limited to digital logging applications for smart-phones. One factor that limits the adoption of such solutions is that they usually require specialized hardware or sensors. Based on this, we evaluate the potential for estimating the weight of consumed food (per bite) based only on the audio signal that is captured by commercial ear buds (Samsung Galaxy Buds). Specifically, we examine a combination of features (both audio and non-audio features) and trainable estimators (linear regression, support vector regression, and neural-network based estimators) and evaluate on an in-house dataset of 8 participants and 4 food types. Results indicate good potential for this approach: our best results yield mean absolute error of less than 1 g for 3 out of 4 food types when training food-specific models, and 2.1 g when training on all food types together, both of which improve over an existing literature approach.
翻译:尽管自动跟踪和测量我们的体育活动是一个已经确立的领域,不仅在研究方面,而且在商业产品和每天的生活方式方面,自动跟踪和测量我们的身体活动是一个已经确立的领域,但是对饮食行为的自动计量却非常有限。尽管书目中有大量的方法和算法,商业解决方案大多局限于智能手机的数字记录应用程序。限制采用这种解决方案的一个因素是,它们通常需要专门的硬件或传感器。基于这一点,我们评估了仅仅根据商业耳蕾(Samsung Galaxy Buds)所捕捉的音频信号估计消费食品(每咬一口)重量的可能性。具体地说,我们考察了各种特征(音频和非音频特点)和可受训天线回归器(线性回归、支持矢量回归和基于天线-网络的测量器)的组合,并评估了由8名参与者和4种食物组成的内部数据集。结果显示,这一方法的好潜力:我们的最佳结果意味着在培训食品特定模型时,4种食物类型中的3类中有3克的绝对误差,以及在所有种类的训练中,每类食物中,每类方法都改进了2.1克。