Many studies on machine learning (ML) for computer-aided diagnosis have so far been mostly restricted to high-quality research data. Clinical data warehouses, gathering routine examinations from hospitals, offer great promises for training and validation of ML models in a realistic setting. However, the use of such clinical data warehouses requires quality control (QC) tools. Visual QC by experts is time-consuming and does not scale to large datasets. In this paper, we propose a convolutional neural network (CNN) for the automatic QC of 3D T1-weighted brain MRI for a large heterogeneous clinical data warehouse. To that purpose, we used the data warehouse of the hospitals of the Greater Paris area (Assistance Publique-H\^opitaux de Paris [AP-HP]). Specifically, the objectives were: 1) to identify images which are not proper T1-weighted brain MRIs; 2) to identify acquisitions for which gadolinium was injected; 3) to rate the overall image quality. We used 5000 images for training and validation and a separate set of 500 images for testing. In order to train/validate the CNN, the data were annotated by two trained raters according to a visual QC protocol that we specifically designed for application in the setting of a data warehouse. For objectives 1 and 2, our approach achieved excellent accuracy (balanced accuracy and F1-score \textgreater 90\%), similar to the human raters. For objective 3, the performance was good but substantially lower than that of human raters. Nevertheless, the automatic approach accurately identified (balanced accuracy and F1-score \textgreater 80\%) low quality images, which would typically need to be excluded. Overall, our approach shall be useful for exploiting hospital data warehouses in medical image computing.
翻译:迄今为止,关于计算机辅助诊断的机器学习(ML)的许多研究大多限于高质量的研究数据。临床数据仓库,从医院收集例行检查,为培训和验证ML模型做出了重大承诺,然而,使用这类临床数据仓库需要质量控制工具。专家的视觉QC耗时且不至于大型数据集。在本文件中,我们提议为3D T1加权自动QC自动QC脑MRI系统,用于大型混合临床数据仓库。为此目的,我们利用了大巴黎地区医院(Pulbique-Hóopitaux de Paris[AP-HPH])的准确率数据库来进行培训和验证。具体来说,目标是:1)确定不适合T1加权大脑MMSI的图像;2)确定哪些采购是注射的;3)评定总体图像质量。我们用5000张图像来进行精准的培训和验证,但另有500张图像用于测试。为了培训/VALI的精准性能,我们为IMIS的精度目标2专门设计了一个直观度数据,通过直观的直观数据来确定一个直观数据。