The labelling of food products in the EU is regulated by the Food Information of Customers (FIC). Companies are required to provide the corresponding information regarding nutrients and allergens among others. With the rise of e-commerce more and more food products are sold online. There are often errors in the online product descriptions regarding the FIC-relevant information due to low data quality in the vendors' product data base. In this paper we propose a hybrid approach of both rule-based and machine learning to verify nutrient declaration and allergen labelling according to FIC requirements. Special focus is given to the problem of false negatives in allergen prediction since this poses a significant health risk to customers. Results show that a neural net trained on a subset of the ingredients of a product is capable of predicting the allergens contained with a high reliability.
翻译:欧盟食品标签由《客户食品信息》管理。公司必须提供有关营养素和过敏基因的相应信息。随着电子商务的兴起,越来越多的食品在网上销售。由于销售商产品数据库的数据质量低,与欧盟食品产品有关的在线产品说明中经常出现错误。本文建议采用基于规则的学习和机器学习的混合方法,按照食品信息的要求核查营养素申报和过敏标签。特别关注过敏基因预测中的虚假负值问题,因为这对客户的健康构成重大风险。结果显示,根据产品成份的一组成分培训的神经网能够可靠地预测含有的过敏基因。