Millions of stray animals suffer on the streets or are euthanized in shelters every day around the world. In order to better adopt stray animals, scoring the pawpularity (cuteness) of stray animals is very important, but evaluating the pawpularity of animals is a very labor-intensive thing. Consequently, there has been an urgent surge of interest to develop an algorithm that scores pawpularity of animals. However, the dataset in Kaggle not only has images, but also metadata describing images. Most methods basically focus on the most advanced image regression methods in recent years, but there is no good method to deal with the metadata of images. To address the above challenges, the paper proposes an image regression model called PETS-SWINF that considers metadata of the images. Our results based on a dataset of Kaggle competition, "PetFinder.my", show that PETS-SWINF has an advantage over only based images models. Our results shows that the RMSE loss of the proposed model on the test dataset is 17.71876 but 17.76449 without metadata. The advantage of the proposed method is that PETS-SWINF can consider both low-order and high-order features of metadata, and adaptively adjust the weights of the image model and the metadata model. The performance is promising as our leadboard score is ranked 15 out of 3545 teams (Gold medal) currently for 2021 Kaggle competition on the challenge "PetFinder.my".
翻译:世界上每天,为了更好地采纳流浪动物,对流浪动物的爪牙(精度)的评分非常重要,但评估动物爪牙的评分是一个非常劳力密集型的事情。因此,人们急切地感兴趣地开发一种算法来分分分动物的爪子。然而,卡格格勒的数据集不仅有图像,而且还有描述图像的元数据。大多数方法基本上侧重于近年来最先进的图像回归方法,但是没有处理图像元数据的好方法。为了应对上述挑战,本文提出了一个名为PETS-SWINF的图像回归模型,该模型将考虑图像的元数据。我们基于卡格勒竞争数据集的结果表明,PETS-SWINF比仅基于图像模型的模型有优势。我们的结果显示,测试数据集的拟议模型损失了17.71876,但17.76449没有元数据。为了应对上述挑战,拟议方法的优点是:“PETS-S-SWINF”的图象回归模型模型模型是:“目前PETS-S-SWINF的排名第15级模型和排名前列的模型,可以考虑“15级模型中的低级”的进度。