The BIOSCAN project, led by the International Barcode of Life Consortium, seeks to study changes in biodiversity on a global scale. One component of the project is focused on studying the species interaction and dynamics of all insects. In addition to genetically barcoding insects, over 1.5 million images per year will be collected, each needing taxonomic classification. With the immense volume of incoming images, relying solely on expert taxonomists to label the images would be impossible; however, artificial intelligence and computer vision technology may offer a viable high-throughput solution. Additional tasks including manually weighing individual insects to determine biomass, remain tedious and costly. Here again, computer vision may offer an efficient and compelling alternative. While the use of computer vision methods is appealing for addressing these problems, significant challenges resulting from biological factors present themselves. These challenges are formulated in the context of machine learning in this paper.
翻译:由生命国际条码联合会牵头的BIOSCAN项目寻求研究全球范围的生物多样性变化,该项目的一个部分侧重于研究所有昆虫的物种相互作用和动态。除了基因条码昆虫之外,每年还将收集150多万张图像,每个图像都需要分类分类。随着大量图像的流入,完全依靠专家分类学家来标注图像是不可能的;然而,人工智能和计算机视觉技术可能提供一个可行的高通量解决方案。额外的任务包括人工称重个别昆虫以确定生物量,仍然乏味和昂贵。在这里,计算机视觉可能提供高效和令人信服的替代方法。虽然计算机视觉方法的使用对于解决这些问题很有吸引力,但生物因素本身也带来了重大挑战。这些挑战是在本文件中机器学习的背景下提出的。