This paper focuses on a core task in computational sustainability and statistical ecology: species distribution modeling (SDM). In SDM, the occurrence pattern of a species on a landscape is predicted by environmental features based on observations at a set of locations. At first, SDM may appear to be a binary classification problem, and one might be inclined to employ classic tools (e.g., logistic regression, support vector machines, neural networks) to tackle it. However, wildlife surveys introduce structured noise (especially under-counting) in the species observations. If unaccounted for, these observation errors systematically bias SDMs. To address the unique challenges of SDM, this paper proposes a framework called StatEcoNet. Specifically, this work employs a graphical generative model in statistical ecology to serve as the skeleton of the proposed computational framework and carefully integrates neural networks under the framework. The advantages of StatEcoNet over related approaches are demonstrated on simulated datasets as well as bird species data. Since SDMs are critical tools for ecological science and natural resource management, StatEcoNet may offer boosted computational and analytical powers to a wide range of applications that have significant social impacts, e.g., the study and conservation of threatened species.
翻译:本文侧重于计算可持续性和统计生态的核心任务:物种分布模型(SDM) 。在SDM中,根据一系列地点的观测结果,环境特征可以预测地貌物种的发生模式。首先,SDM似乎是一个二进制分类问题,人们可能倾向于使用经典工具(如后勤回归、支持病媒机器、神经网络)来解决这一问题。然而,野生动物调查在物种观测中引入了结构化噪音(特别是低量计算),如果这些观测错误没有找到,则系统性偏向SDMs。为了应对SDM的独特挑战,本文提出了一个称为StatEcoNet的框架。具体地说,这项工作在统计生态学中采用一个图形化的基因化模型,作为拟议计算框架的骨架,并仔细整合框架下的神经网络。StatecoNet相对于相关方法的优势在模拟数据集和鸟类物种数据上得到了证明。由于SDMSDM是生态科学和自然资源管理的关键工具,因此StaTEcoNet可能为具有重大社会影响的广泛应用提供强化的计算和分析能力,例如,研究。