The popularity of Machine learning (ML), Deep learning (DL), and Artificial intelligence (AI) has sharply risen in recent years. Despite their spike in popularity, the inner workings of ML and DL algorithms are perceived as opaque, and their relationship to classical data analysis tools remains debated. It is often assumed that ML and DL excel primarily at making predictions. Recently, however, they have been increasingly used for classical analytical tasks traditionally covered by statistical models. Moreover, recent reviews on ML have focused exclusively on DL, missing out on synthesizing the wealth of ML algorithms with different advantages and general principles. Here, we provide a comprehensive overview of the field of ML and DL, starting with its historical developments, the existing algorithm families, their differences from traditional statistical tools, and universal ML principles. We then discuss why and when ML and DL models excel at prediction tasks and where they could offer alternatives to traditional statistical methods for inference, highlighting current and emerging applications for ecological problems. Finally, we summarize emerging trends such as scientific and causal ML, explainable AI, and responsible AI that may significantly impact ecological data analysis in the future.
翻译:近年来,机器学习(ML)、深层学习(DL)和人工智能(AI)的普及程度急剧提高。尽管其受欢迎程度高涨,但ML和DL算法的内部运作被认为不透明,而且它们与古典数据分析工具的关系仍然争论不休。人们常常认为ML和DL主要在预测方面有所成就。然而,最近,它们越来越多地被用于传统的统计模型所覆盖的典型分析任务。此外,最近对ML的审查专门侧重于DL, 遗漏了对具有不同优势和一般原则的ML算法财富的合成。在这里,我们全面概述了ML和DL领域的情况,从历史发展、现有算法系、它们与传统统计工具的差异以及通用 ML原则开始。然后我们讨论ML和DL模型为什么和何时在预测任务中取得成就,以及何时可以提供替代传统统计方法的替代方法,用于推断,突出当前和新出现的生态问题应用。我们总结了新出现的趋势,如科学和因果关系ML、可解释的AI和负责任的AI等,以及可能对未来的生态数据分析产生重大影响。