透明环境微生物图象:从进进神经网络到视觉变异器的深学习方法补齐分类比较 (A Comparison for Patch-level Classification of Deep Learning Methods on Transparent Environmental Microorganism Images: from Convolutional Neural Networks to Visual Transformers)

Neural Networks · Networking · Performer · 回合 · 卷积神经网络 ·

2021 年 7 月 21 日

A Comparison for Patch-level Classification of Deep Learning Methods on Transparent Environmental Microorganism Images: from Convolutional Neural Networks to Visual Transformers

翻译：透明环境微生物图象:从进进神经网络到视觉变异器的深学习方法补齐分类比较

Hechen Yang,Chen Li,Jinghua Zhang,Peng Zhao,Ao Chen,Xin Zhao,Tao Jiang,Marcin Grzegorzek

Nowadays, analysis of Transparent Environmental Microorganism Images (T-EM images) in the field of computer vision has gradually become a new and interesting spot. This paper compares different deep learning classification performance for the problem that T-EM images are challenging to analyze. We crop the T-EM images into 8 * 8 and 224 * 224 pixel patches in the same proportion and then divide the two different pixel patches into foreground and background according to ground truth. We also use four convolutional neural networks and a novel ViT network model to compare the foreground and background classification experiments. We conclude that ViT performs the worst in classifying 8 * 8 pixel patches, but it outperforms most convolutional neural networks in classifying 224 * 224 pixel patches.

翻译：目前,计算机视觉领域透明环境微生物图像分析(T-EM图像)逐渐成为一个有趣的新点。本文比较了T-EM图像分析困难的问题的不同深层次学习分类性能。我们将T-EM图像切成8 * 8 和224 * 224 像素补丁, 比例相同, 然后根据地面真相将两个不同的像素补丁分割成前景和背景。我们还使用四个神经神经网络和一个新颖的VIT网络模型来比较地表和背景分类实验。我们的结论是, VIT在对8 * 8 像素补丁进行分类方面表现最差, 但它在对224 * 224 像素补丁进行分类时, 超越了大多数革命性神经网络。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/