This document gives a concise outline of some of the common mistakes that occur when using machine learning techniques, and what can be done to avoid them. It is intended primarily as a guide for research students, and focuses on issues that are of particular concern within academic research, such as the need to do rigorous comparisons and reach valid conclusions. It covers five stages of the machine learning process: what to do before model building, how to reliably build models, how to robustly evaluate models, how to compare models fairly, and how to report results.
翻译:本文件简要概述了在使用机器学习技术时发生的一些常见错误,以及可以做些什么来避免这些错误。它主要旨在作为研究学生的指南,侧重于学术研究中特别关注的问题,如进行严格比较和得出有效结论的必要性。它涵盖了机器学习过程的五个阶段:建模前应做什么,如何可靠地建立模型,如何强有力地评估模型,如何公平比较模型,如何报告结果。