We introduce a novel dataset for architectural style classification, consisting of 9,485 images of church buildings. Both images and style labels were sourced from Wikipedia. The dataset can serve as a benchmark for various research fields, as it combines numerous real-world challenges: fine-grained distinctions between classes based on subtle visual features, a comparatively small sample size, a highly imbalanced class distribution, a high variance of viewpoints, and a hierarchical organization of labels, where only some images are labeled at the most precise level. In addition, we provide 631 bounding box annotations of characteristic visual features for 139 churches from four major categories. These annotations can, for example, be useful for research on fine-grained classification, where additional expert knowledge about distinctive object parts is often available. Images and annotations are available at: https://doi.org/10.5281/zenodo.5166987
翻译:我们推出建筑风格分类的新数据集,由9 485个教堂建筑图象组成,图像和风格标签都来源于维基百科。数据集可以作为各种研究领域的基准,因为它结合了无数现实世界的挑战:根据微妙的视觉特征对不同类别进行细微区分,抽样规模较小,等级分布高度不平衡,观点差异很大,标签的等级结构,其中只有某些图像贴上了最精确的标签。此外,我们提供了来自四大类139个教堂的特有视觉特征的631个边框插插插图。例如,这些说明可用于精细分类研究,经常在其中提供关于特殊对象部分的其他专家知识。图象和说明见:https://doi.org/10.5281/zenodo.5166987。