超声波上甲状腺结核多步自动数据标签程序:人工智能自动图像说明方法 (Multistep Automated Data Labelling Procedure (MADLaP) for Thyroid Nodules on Ultrasound: An Artificial Intelligence Approach for Automating Image Annotation)

Automator · 数据标签 · 模型评估 · 标注 · Learning ·

2022 年 6 月 28 日

Multistep Automated Data Labelling Procedure (MADLaP) for Thyroid Nodules on Ultrasound: An Artificial Intelligence Approach for Automating Image Annotation

翻译：超声波上甲状腺结核多步自动数据标签程序:人工智能自动图像说明方法

Jikai Zhang,Maciej M. Mazurowski,Brian C. Allen,Benjamin Wildman-Torbiner

Machine learning (ML) for diagnosis of thyroid nodules on ultrasound is an active area of research. However, ML tools require large, well-labelled datasets, the curation of which is time-consuming and labor-intensive. The purpose of our study was to develop and test a deep-learning-based tool to facilitate and automate the data annotation process for thyroid nodules; we named our tool Multistep Automated Data Labelling Procedure (MADLaP). MADLaP was designed to take multiple inputs included pathology reports, ultrasound images, and radiology reports. Using multiple step-wise modules including rule-based natural language processing, deep-learning-based imaging segmentation, and optical character recognition, MADLaP automatically identified images of a specific thyroid nodule and correctly assigned a pathology label. The model was developed using a training set of 378 patients across our health system and tested on a separate set of 93 patients. Ground truths for both sets were selected by an experienced radiologist. Performance metrics including yield (how many labeled images the model produced) and accuracy (percentage correct) were measured using the test set. MADLaP achieved a yield of 63% and an accuracy of 83%. The yield progressively increased as the input data moved through each module, while accuracy peaked part way through. Error analysis showed that inputs from certain examination sites had lower accuracy (40%) than the other sites (90%, 100%). MADLaP successfully created curated datasets of labeled ultrasound images of thyroid nodules. While accurate, the relatively suboptimal yield of MADLaP exposed some challenges when trying to automatically label radiology images from heterogeneous sources. The complex task of image curation and annotation could be automated, allowing for enrichment of larger datasets for use in machine learning development.

翻译：用于在超声波上诊断甲状腺结核的机器学习(ML)是一个活跃的研究领域。然而,ML工具需要大型、标签周密的数据集,这些数据集的校正需要花费时间和劳动密集型。我们研究的目的是开发和测试一个基于深层学习的工具,以促进和自动化甲状腺结核的数据批注过程;我们命名了我们的工具多步自动数据标签程序(MADLAP),设计MADLAP是为了接受多种投入,包括病理报告、超声波图像和放射报告。使用多步进式模块,包括基于规则的自然语言处理、基于深度学习的图像分解和光学字符识别。MADLAP自动识别一个基于深层学习的工具,以促进和测试一个基于深层学习工具的工具。该模型是使用由378名病人组成的培训集,由93名病人单独测试。两组的地面真相可以由有经验的放射学家选择。业绩指标,包括收益(许多标签显示模型的相对值)和精度(正确度),使用测试集的精度的精度,由测试组测的精度测的精度显示一个特定的精度,而MADLADLADLADLAD的精度的精度,通过每部的精度测试的精度则显示的精度,通过测试的精度,通过测试的精度的精度,将精度的精度分析的精度为83号的精度的精度。

相关内容

Automator

关注 0

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日