The re-emergence of mosquito-borne diseases (MBDs), which kill hundreds of thousands of people each year, has been attributed to increased human population, migration, and environmental changes. Convolutional neural networks (CNNs) have been used by several studies to recognise mosquitoes in images provided by projects such as Mosquito Alert to assist entomologists in identifying, monitoring, and managing MBD. Nonetheless, utilising CNNs to automatically label input samples could involve incorrect predictions, which may mislead future epidemiological studies. Furthermore, CNNs require large numbers of manually annotated data. In order to address the mentioned issues, this paper proposes using the Monte Carlo Dropout method to estimate the uncertainty scores in order to rank the classified samples to reduce the need for human supervision in recognising Aedes albopictus mosquitoes. The estimated uncertainty was also used in an active learning framework, where just a portion of the data from large training sets was manually labelled. The experimental results show that the proposed classification method with rejection outperforms the competing methods by improving overall performance and reducing entomologist annotation workload. We also provide explainable visualisations of the different regions that contribute to a set of samples' uncertainty assessment.
翻译:蚊虫传染疾病(MBDs)每年造成数十万人死亡,其重新出现是由于人类人口、移徙和环境变化的增加; 革命性神经网络(CNNs)被数项研究用来识别蚊子在蚊子警报等项目提供的图像中发现的蚊子,以协助昆虫学家识别、监测和管理MBD。然而,利用CNN为输入样本自动贴标签可能涉及不正确的预测,这可能会误导未来的流行病学研究。此外,CNN需要大量人工附加说明的数据。为了解决上述问题,本文提议使用蒙特卡洛漏出方法来估计不确定性的分数,以便对分类样本进行分级,以减少在识别艾德斯高比目蚊子方面对人体监督的需要。估计不确定性还用于一个积极的学习框架,其中仅对大型培训成套数据的一部分进行了人工标注。实验结果表明,拟议的拒绝分类方法通过改进总体性能和减少昆虫说明工作量,从而超越了相互竞争的方法。我们还提供了不同区域的可预见性评估。我们还提供了不同区域的模型。