Automatic snake species recognition is important because it has vast potential to help lower deaths and disabilities caused by snakebites. We introduce our solution in SnakeCLEF 2022 for fine-grained snake species recognition on a heavy long-tailed class distribution. First, a network architecture is designed to extract and fuse features from multiple modalities, i.e. photograph from visual modality and geographic locality information from language modality. Then, logit adjustment based methods are studied to relieve the impact caused by the severe class imbalance. Next, a combination of supervised and self-supervised learning method is proposed to make full use of the dataset, including both labeled training data and unlabeled testing data. Finally, post processing strategies, such as multi-scale and multi-crop test-time-augmentation, location filtering and model ensemble, are employed for better performance. With an ensemble of several different models, a private score 82.65%, ranking the 3rd, is achieved on the final leaderboard.
翻译:自动蛇物种的识别很重要,因为它有巨大的潜力帮助减少蛇类动物造成的死亡和残疾。我们在2022年的蛇状CLEF中引入了一种解决方案,用于在长尾长尾类分配中微粒的蛇类物种的识别。首先,设计了一个网络结构,从多种模式中提取和引信特征,即视觉模式的照片和语言模式中的地理地点信息。然后,对基于登录调整的方法进行研究,以减轻严重阶级不平衡的影响。接着,建议采用监督和自我监督的学习方法,以充分利用数据集,包括标签培训数据和无标签测试数据。最后,采用后处理战略,如多尺度和多作物测试时间放大、位置过滤和模型共通性,以更好地发挥作用。在最后的领头板上,将几种不同模式组合在一起,私人评分82.65%,排第三位。