In the emerging era of big data, larger available clinical datasets and computational advances have sparked a massive interest in machine learning-based approaches. The number of manuscripts related to machine learning or artificial intelligence has exponentially increased over the past years. As analytical machine learning tools become readily available for clinicians to use, the understanding of key concepts and the awareness of analytical pitfalls are increasingly required for clinicians, investigators, reviewers and editors, who even as experts in their clinical field, sometimes find themselves insufficiently equipped to evaluate machine learning methodologies. In the first section, we provide explanations on the general principles of machine learning, as well as analytical steps required for successful machine learning-based predictive modelling - which is the focus of this series. In further sections, we review the importance of resampling, overfitting and model generalizability as well as feature reduction and selection (Part II), strategies for model evaluation, reporting and discussion of common caveats and other points of significance (Part III), as well as offer a practical guide to classification (Part IV) and regression modelling (Part V), with a complete coding pipeline. Methodological rigor and clarity as well as understanding of the underlying reasoning of the internal workings of a machine learning approach are required, otherwise predictive applications despite being strong analytical tools are not well accepted into the clinical routine. Going forward, machine learning and artificial intelligence shape and influence modern medicine across disciplines including the field of neurosurgery.
翻译:在新的大数据时代,现有更多的临床数据集和计算进步引起了对机器学习方法的极大兴趣。过去几年,与机器学习或人工智能有关的手稿数量急剧增加。随着分析机器学习工具随时可供临床医生使用,对关键概念的理解和对分析陷阱的认识日益需要,临床医生、调查员、审核员和编辑甚至作为临床领域的专家,有时发现自己不具备评价机器学习方法的足够条件。在第一节,我们解释了机器学习的一般原则,以及成功的机器学习预测模型所需的分析步骤——这是本系列工作的重点。在进一步各节,我们审查重新抽取、超配和模型通用的重要性,以及减少和选择特征(第二部分),示范评价、报告和讨论共同洞穴和其他重要要点(第三部分)的战略,以及为分类(第四部分)和回归模型(第五部分)提供实用指南,并附有完整的编织管道。方法严谨和清晰明晰,作为以其他方式学习、超前期分析工具,正在深入地学习、深入地进行机前期分析,同时不断学习、深入地了解机前期的机理学方法。