Foraminifera are single-celled marine organisms that construct shells that remain as fossils in the marine sediments. Classifying and counting these fossils are important in e.g. paleo-oceanographic and -climatological research. However, the identification and counting process has been performed manually since the 1800s and is laborious and time-consuming. In this work, we present a deep learning-based instance segmentation model for classifying, detecting, and segmenting microscopic foraminifera. Our model is based on the Mask R-CNN architecture, using model weight parameters that have learned on the COCO detection dataset. We use a fine-tuning approach to adapt the parameters on a novel object detection dataset of more than 7000 microscopic foraminifera and sediment grains. The model achieves a (COCO-style) average precision of $0.78 \pm 0.00$ on the classification and detection task, and $0.80 \pm 0.00$ on the segmentation task. When the model is evaluated without challenging sediment grain images, the average precision for both tasks increases to $0.84 \pm 0.00$ and $0.86 \pm 0.00$, respectively. Prediction results are analyzed both quantitatively and qualitatively and discussed. Based on our findings we propose several directions for future work, and conclude that our proposed model is an important step towards automating the identification and counting of microscopic foraminifera.
翻译:孔虫子是一种单细胞海洋生物,其构成的贝壳在海洋沉积物中仍为化石。这些化石的分类和计算在古生物学和气候学研究中很重要。然而,自1800年代以来,鉴定和计数过程一直是手工进行的,而且很费力和费时。在这项工作中,我们提出了一个深层次的基于学习的试样分解模型,用于分类、检测和分解显微孔虫。我们的模型以Mask R-CNN结构为基础,使用在COCO探测数据集中学习的模型重量参数。我们使用微调方法来调整关于7000多颗微粒和沉积物颗粒的新天体探测数据集的参数。模型在分类和检测任务方面平均达到0.78 百万 百万 美元 美元 的模型平均精确度,在分解任务方面拟议的0.80 0.00 0.00 美元 。当模型在没有挑战性沉积物图像的情况下进行评估时,我们的任务的平均精确度将提高至0.84 m 和 质量分析结果。