Identification of fossil species is crucial to evolutionary studies. Recent advances from deep learning have shown promising prospects in fossil image identification. However, the quantity and quality of labeled fossil images are often limited due to fossil preservation, conditioned sampling, and expensive and inconsistent label annotation by domain experts, which pose great challenges to the training of deep learning based image classification models. To address these challenges, we follow the idea of the wisdom of crowds and propose a novel multiview ensemble framework, which collects multiple views of each fossil specimen image reflecting its different characteristics to train multiple base deep learning models and then makes final decisions via soft voting. We further develop OGS method that integrates original, gray, and skeleton views under this framework to demonstrate the effectiveness. Experimental results on the fusulinid fossil dataset over five deep learning based milestone models show that OGS using three base models consistently outperforms the baseline using a single base model, and the ablation study verifies the usefulness of each selected view. Besides, OGS obtains the superior or comparable performance compared to the method under well-known bagging framework. Moreover, as the available training data decreases, the proposed framework achieves more performance gains compared to the baseline. Furthermore, a consistency test with two human experts shows that OGS obtains the highest agreement with both the labels of dataset and the two experts. Notably, this methodology is designed for general fossil identification and it is expected to see applications on other fossil datasets. The results suggest the potential application when the quantity and quality of labeled data are particularly restricted, e.g., to identify rare fossil images.
翻译:化石物种的识别是进化研究的关键。最近深层学习的进展显示,在化石图像的识别方面前景大有希望。然而,标签化石图像的数量和质量往往有限,原因是域专家保存化石、进行有条件的取样、以及昂贵和前后不一致的标签说明,这对深层学习的图像分类模型的培训构成巨大挑战。为了应对这些挑战,我们遵循人群智慧的概念,并提议一个新颖的多视图混合框架,其中收集每个化石标本的多种观点,反映其不同特点,以训练多种基础深层学习模型,然后通过软投票作出最后决定。我们进一步开发OGS方法,将原始的、灰色的和骨架观点结合起来,以展示其有效性。在五个深层基于里程碑的模型中,关于氟化石矿物数据集的实验结果显示,使用三个基模型始终超越基线,使用一个单一基模型,而一个混合研究证实每个选定视图的有用性。此外,OGS获得优或可比的绩效,然后通过熟悉的包装框架下的方法作出最后决定。此外,由于现有的培训数据应用结果显示,Oslind Stal 框架的预期数据将比专家获得更高的数据。