Molecular and morphological characters, as important parts of biological taxonomy, are contradictory but need to be integrated. Organism's image recognition and bioinformatics are emerging and hot problems nowadays but with a gap between them. In this work, a multi-branching recognition framework mediated by genetic information bridges this barrier, which establishes the link between macro-morphology and micro-molecular information of mushrooms. The novel multi-perspective structure is proposed to fuse the feature images from three branching models, which significantly improves the accuracy of recognition by about 10% and up to more than 90%. Further, genetic information is implemented to the mushroom image recognition task by using genetic distance embeddings as the representation space for predicting image distance and species identification. Semantic overfitting of traditional classification tasks and the granularity of fine-grained image recognition are also discussed in depth for the first time. The generalizability of the model was investigated in fine-grained scenarios using zero-shot learning tasks, which could predict the taxonomic and evolutionary information of unseen samples. We presented the first method to map images to DNA, namely used an encoder mapping image to genetic distances, and then decoded DNA through a pre-trained decoder, where the total test accuracy on 37 species for DNA prediction is 87.45%. This study creates a novel recognition framework by systematically studying the mushroom image recognition problem, bridging the gap between macroscopic biological information and microscopic molecular information, which will provide a new reference for intelligent biometrics in the future.
翻译:分子和形态特征是生物分类学的重要部分,是相互矛盾的,但需要整合。生物生物学的图像识别和生物信息学正在出现,但目前存在热点问题。在这项工作中,利用基因远程嵌入作为预测图像距离和物种识别的表示空间,以基因信息为介质的多处识别框架,将这一屏障连接起来,从而在蘑菇的宏观形态学和微分子信息之间建立了联系。提出了新颖的多视角结构,将三个分支模型的特征图像结合起来,这大大提高了大约10%的准确度,达到90%以上。此外,基因信息在蘑菇图像识别工作中得到了实施,使用基因远程嵌入作为预测图像距离和物种识别的表示空间。还首次深入讨论了传统分类任务和微细微放大图像识别的颗粒性。在精细的假设中,利用零光学学习任务对模型的一般可测量,从而可以预测新样本的分类和进化信息的准确度。我们介绍了用于绘制图像的地图的首种方法,即通过DNA测试前的DNA,通过测试的DNA来进行测测测测测测测测测前的DNA。