Recent advances in artificial intelligence promote a wide range of computer vision applications in many different domains. Digital cameras, acting as human eyes, can perceive fundamental object properties, such as shapes and colors, and can be further used for conducting high-level tasks, such as image classification, and object detections. Human perceptions have been widely recognized as the ground truth for training and evaluating computer vision models. However, in some cases, humans can be deceived by what they have seen. Well-functioned human vision relies on stable external lighting while unnatural illumination would influence human perception of essential characteristics of goods. To evaluate the illumination effects on human and computer perceptions, the group presents a novel dataset, the Food Vision Dataset (FVD), to create an evaluation benchmark to quantify illumination effects, and to push forward developments of illumination estimation methods for fair and reliable consumer acceptability prediction from food appearances. FVD consists of 675 images captured under 3 different power and 5 different temperature settings every alternate day for five such days.
翻译:最近人工智能的进步在许多不同领域促进了广泛的计算机视觉应用。数字照相机作为人的眼睛,可以感知基本的物体特性,例如形状和颜色,还可以进一步用于执行高层次的任务,例如图像分类和物体探测。人类的感知被广泛视为培训和评价计算机视觉模型的基本真理。然而,在某些情况下,人类可能受到他们所看到的东西的欺骗。功能良好的人类视觉依靠稳定的外部照明,而非自然的照明会影响人类对货物基本特征的感知。为了评估对人类和计算机感知的光化影响,该摄影机提供了一套新的数据集,即食品视觉数据集(FVD),以建立一个评价基准来量化照明效应,并推进照明估计方法的发展,以便从食品外观中公正可靠的消费者可接受性预测。FVD由675幅图像组成,在3种不同能量下拍摄,5种不同的温度环境下每隔五天每隔一天摄取6天。