Background & Aims: Hepatic steatosis is a major cause of chronic liver disease. 2D ultrasound is the most widely used non-invasive tool for screening and monitoring, but associated diagnoses are highly subjective. We developed a scalable deep learning (DL) algorithm for quantitative scoring of liver steatosis from 2D ultrasound images. Approach & Results: Using retrospectively collected multi-view ultrasound data from 3,310 patients, 19,513 studies, and 228,075 images, we trained a DL algorithm to diagnose steatosis stages (healthy, mild, moderate, or severe) from ultrasound diagnoses. Performance was validated on two multi-scanner unblinded and blinded (initially to DL developer) histology-proven cohorts (147 and 112 patients) with histopathology fatty cell percentage diagnoses, and a subset with FibroScan diagnoses. We also quantified reliability across scanners and viewpoints. Results were evaluated using Bland-Altman and receiver operating characteristic (ROC) analysis. The DL algorithm demonstrates repeatable measurements with a moderate number of images (3 for each viewpoint) and high agreement across 3 premium ultrasound scanners. High diagnostic performance was observed across all viewpoints: area under the curves of the ROC to classify >=mild, >=moderate, =severe steatosis grades were 0.85, 0.90, and 0.93, respectively. The DL algorithm outperformed or performed at least comparably to FibroScan with statistically significant improvements for all levels on the unblinded histology-proven cohort, and for =severe steatosis on the blinded histology-proven cohort. Conclusions: The DL algorithm provides a reliable quantitative steatosis assessment across view and scanners on two multi-scanner cohorts. Diagnostic performance was high with comparable or better performance than FibroScan.
翻译:背景与目标 : Hepatic sateatsismission 是慢性肝脏疾病的一个主要原因。 2D 超声波是用于筛选和监测的最广泛使用的非侵入性工具,但相关的诊断是高度主观的。 我们开发了一个可扩缩的深度学习(DL)算法,用于从 2D 超声波图像中量化肝胃病。 方法与结果: 使用从3 310 名病人、 19 513 研究 和 228 075 0.75 图像 收集的多视图超声波数据, 我们训练了DL 算法, 用于诊断超声波诊断诊断性阶段(健康、轻度、中度或严重) 的DL 算法。 两个多扫描器的性能被验证了(不盲和盲) (最初为 DL 开发者) 直位组群(147 和 112 病人) 及其直位细胞细胞诊断性能诊断, 以及一个子集的诊断性能。 我们在扫描和观点上量化的可靠性。 利用Bland-Alterman-A和接收器运行的性性特征 (ROC0) 分析期间的 3 快速分析中, DL dalaldals 显示的性能的性能的性能测试。