Evolution of visual object recognition architectures based on Convolutional Neural Networks & Convolutional Deep Belief Networks paradigms has revolutionized artificial Vision Science. These architectures extract & learn the real world hierarchical visual features utilizing supervised & unsupervised learning approaches respectively. Both the approaches yet cannot scale up realistically to provide recognition for a very large number of objects as high as 10K. We propose a two level hierarchical deep learning architecture inspired by divide & conquer principle that decomposes the large scale recognition architecture into root & leaf level model architectures. Each of the root & leaf level models is trained exclusively to provide superior results than possible by any 1-level deep learning architecture prevalent today. The proposed architecture classifies objects in two steps. In the first step the root level model classifies the object in a high level category. In the second step, the leaf level recognition model for the recognized high level category is selected among all the leaf models. This leaf level model is presented with the same input object image which classifies it in a specific category. Also we propose a blend of leaf level models trained with either supervised or unsupervised learning approaches. Unsupervised learning is suitable whenever labelled data is scarce for the specific leaf level models. Currently the training of leaf level models is in progress; where we have trained 25 out of the total 47 leaf level models as of now. We have trained the leaf models with the best case top-5 error rate of 3.2% on the validation data set for the particular leaf models. Also we demonstrate that the validation error of the leaf level models saturates towards the above mentioned accuracy as the number of epochs are increased to more than sixty.
翻译:基于 Convolutional Neal 网络和 Convolution 深信仰网络模式的视觉对象识别架构的演进演变,使人工视觉科学革命化。这些架构利用监管和不受监管的学习方法分别提取和学习真实世界等级的视觉特征。这两种方法都无法现实地扩大,为10K级的众多天体提供识别。我们建议了两个等级级的深层次学习架构,根据分解原则,将大比例识别架构分解成根和叶级模型结构。根叶级和叶叶级模型的每一个模型都只受过培训,以提供优于当今盛行的任何1级精度深层学习结构可能取得的结果。提议的架构将对象分为两个步骤。在第一步,根级模型将对象归类为高至10K级。在第二步,所有叶级模型中选择了公认的高层次叶级叶级识别模型。这个叶级模型与将它分类为具体类别和叶级模型的同一输入对象图像。我们还提议将叶级模型混编成一个组合模型,既监督又不及不精度的精度,只要我们所训练的叶级模型的精度学习方法,就能够学习了。