Origami is becoming more and more relevant to research. However, there is no public dataset yet available and there hasn't been any research on this topic in machine learning. We constructed an origami dataset using images from the multimedia commons and other databases. It consists of two subsets: one for classification of origami images and the other for difficulty estimation. We obtained 16000 images for classification (half origami, half other objects) and 1509 for difficulty estimation with $3$ different categories (easy: 764, intermediate: 427, complex: 318). The data can be downloaded at: https://github.com/multimedia-berkeley/OriSet. Finally, we provide machine learning baselines.
翻译:但是,目前还没有公共数据集,在机器学习中也没有关于这一主题的任何研究。我们利用多媒体公域和其他数据库的图像建立了一个折纸数据集。它由两个子集组成:一个用于折纸图像分类,另一个用于难度估计。我们获得了16 000张用于分类的图像(半折纸,其他半天体)和1509张用于困难估算的图像,不同类别为3 000美元(方便:764,中间:427,复杂:318美元)。 数据可以在以下网址下载:https://github.com/multimedia-berkeley/OriSet。 最后,我们提供了机器学习基线。