The advent of social media platforms has been a catalyst for the development of digital photography that engendered a boom in vision applications. With this motivation, we introduce a large-scale dataset termed 'Photozilla', which includes over 990k images belonging to 10 different photographic styles. The dataset is then used to train 3 classification models to automatically classify the images into the relevant style which resulted in an accuracy of ~96%. With the rapid evolution of digital photography, we have seen new types of photography styles emerging at an exponential rate. On that account, we present a novel Siamese-based network that uses the trained classification models as the base architecture to adapt and classify unseen styles with only 25 training samples. We report an accuracy of over 68% for identifying 10 other distinct types of photography styles. This dataset can be found at https://trisha025.github.io/Photozilla/
翻译:社交媒体平台的出现是数字摄影发展的催化剂,它催生了视觉应用的蓬勃发展。有了这个动机,我们引入了一个名为“Photozilla”的大规模数据集,其中包括属于10种不同摄影风格的990千多张图像。然后,数据集用于培训3个分类模型,将图像自动分类为相关风格,从而导致~96%的准确率。随着数字摄影的迅速演变,我们看到新型摄影风格以指数速速率出现。从这个角度讲,我们展示了一个以亚马西语为基础的新颖网络,利用经过培训的分类模型作为基础结构,对仅25个培训样本的无形风格进行适应和分类。我们报告,68%的准确率用于识别另外10种不同类型的摄影风格。该数据集可在https://trisha5.02github.io/Photozilla/上找到。