Convolutional neural networks have shown successful results in image classification achieving real-time results superior to the human level. However, texture images still pose some challenge to these models due, for example, to the limited availability of data for training in several problems where these images appear, high inter-class similarity, the absence of a global viewpoint of the object represented, and others. In this context, the present paper is focused on improving the accuracy of convolutional neural networks in texture classification. This is done by extracting features from multiple convolutional layers of a pretrained neural network and aggregating such features using Fisher vector. The reason for using features from earlier convolutional layers is obtaining information that is less domain specific. We verify the effectiveness of our method on texture classification of benchmark datasets, as well as on a practical task of Brazilian plant species identification. In both scenarios, Fisher vectors calculated on multiple layers outperform state-of-art methods, confirming that early convolutional layers provide important information about the texture image for classification.
翻译:然而,图象仍对这些模型构成一些挑战,例如,在出现这些图象、高等级相近性、对所代表的物体缺乏全球观点等若干问题的培训数据有限,因此,在图像分类方面,这些图象的实时结果优于人类水平。在这方面,本文件的重点是提高图象分类中富集性神经网络的准确性。这是通过从受过训练的神经网络的多个卷发层提取特征,并利用渔业矢量汇集这些特征来完成的。使用早先的卷发层特征的原因是获取了较不具体的领域信息。我们核实了我们的基准数据集纹理分类方法的有效性,以及巴西植物物种识别的实际任务。在这两种情景中,根据多层超越最新方法计算的渔业矢量,证实早期的图象为分类提供了重要信息。