Retail product Image classification problems are often few shot classification problems, given retail product classes cannot have the type of variations across images like a cat or dog or tree could have. Previous works have shown different methods to finetune Convolutional Neural Networks to achieve better classification accuracy on such datasets. In this work, we try to address the problem statement : Can we pretrain a Convolutional Neural Network backbone which yields good enough representations for retail product images, so that training a simple logistic regression on these representations gives us good classifiers ? We use contrastive learning and pseudolabel based noisy student training to learn representations that get accuracy in order of finetuning the entire Convnet backbone for retail product image classification.
翻译:零售产品图像分类问题往往很少是零星分类问题,因为零售产品类别不可能有像猫、狗或树那样的图象的变异类型。 先前的工程已经展示了对进化神经网络进行微调的不同方法,以提高这类数据集的分类准确性。 在这项工作中,我们试图解决问题声明:我们能否预设进化进化神经网络主干线,为零售产品图像提供足够多的体现,以便对这些图象进行简单的后勤回归培训,使我们有良好的分类者? 我们使用对比式学习和假贴标签的吵闹学生培训,以学习能够准确调整整个Convnet骨干以达到零售产品图像分类的准确性。