Machine learning methods exploiting multi-parametric biomarkers, especially based on neuroimaging, have huge potential to improve early diagnosis of dementia and to predict which individuals are at-risk of developing dementia. To benchmark algorithms in the field of machine learning and neuroimaging in dementia and assess their potential for use in clinical practice and clinical trials, seven grand challenges have been organized in the last decade: MIRIAD, Alzheimer's Disease Big Data DREAM, CADDementia, Machine Learning Challenge, MCI Neuroimaging, TADPOLE, and the Predictive Analytics Competition. Based on two challenge evaluation frameworks, we analyzed how these grand challenges are complementing each other regarding research questions, datasets, validation approaches, results and impact. The seven grand challenges addressed questions related to screening, diagnosis, prediction and monitoring in (pre-clinical) dementia. There was little overlap in clinical questions, tasks and performance metrics. Whereas this has the advantage of providing insight on a broad range of questions, it also limits the validation of results across challenges. In general, winning algorithms performed rigorous data pre-processing and combined a wide range of input features. Despite high state-of-the-art performances, most of the methods evaluated by the challenges are not clinically used. To increase impact, future challenges could pay more attention to statistical analysis of which factors (i.e., features, models) relate to higher performance, to clinical questions beyond Alzheimer's disease, and to using testing data beyond the Alzheimer's Disease Neuroimaging Initiative. Given the potential and lessons learned in the past ten years, we are excited by the prospects of grand challenges in machine learning and neuroimaging for the next ten years and beyond.
翻译:利用多参数生物标志的机器学习方法,特别是基于神经成像学的机器学习方法,在改进痴呆症早期诊断和预测哪些人有发展痴呆症的风险方面具有巨大的潜力。为了在痴呆症的机器学习和神经成形学领域衡量算法,并评估其在临床实践和临床试验中使用这些算法的潜力,在过去十年里组织了七大挑战:MIRIAD、阿尔茨海默氏病大数据DREAM、CADDENIGE、机器学习挑战、MCI Neuroimaging、TADPOLE和预测分析竞争。根据两个挑战评价框架,我们分析了这些巨大的挑战是如何在研究问题、数据集、验证方法、结果和影响方面相互补充的。这七大挑战涉及与(临床前)痴呆症的筛选、诊断、预测和监测有关的问题。临床问题、任务和性能衡量方面几乎没有重叠。这有利于对广泛的问题进行深入的洞察,但也限制了对挑战的验证。在一般情况下,通过严格的算算法进行最严格的前期数据分析,过去的数据分析,而未来数据分析则结合了过去的数据分析。