Machine learning is revolutionizing image-based diagnostics in pathology and radiology. ML models have shown promising results in research settings, but their lack of interoperability has been a major barrier for clinical integration and evaluation. The DICOM a standard specifies Information Object Definitions and Services for the representation and communication of digital images and related information, including image-derived annotations and analysis results. However, the complexity of the standard represents an obstacle for its adoption in the ML community and creates a need for software libraries and tools that simplify working with data sets in DICOM format. Here we present the highdicom library, which provides a high-level application programming interface for the Python programming language that abstracts low-level details of the standard and enables encoding and decoding of image-derived information in DICOM format in a few lines of Python code. The highdicom library ties into the extensive Python ecosystem for image processing and machine learning. Simultaneously, by simplifying creation and parsing of DICOM-compliant files, highdicom achieves interoperability with the medical imaging systems that hold the data used to train and run ML models, and ultimately communicate and store model outputs for clinical use. We demonstrate through experiments with slide microscopy and computed tomography imaging, that, by bridging these two ecosystems, highdicom enables developers to train and evaluate state-of-the-art ML models in pathology and radiology while remaining compliant with the DICOM standard and interoperable with clinical systems at all stages. To promote standardization of ML research and streamline the ML model development and deployment process, we made the library available free and open-source.
翻译:机器学习正在使病理学和放射学方面的基于图像的诊断发生革命性的变化。ML模型在研究环境中显示出令人乐观的结果,但缺乏互操作性是临床整合和评价的一个主要障碍。DICOM标准为数字图像和相关信息的表达和通信提供了信息对象定义和服务,包括图像衍生说明和分析结果。然而,该标准的复杂性阻碍了在ML社区采用该标准,并导致需要软件库和工具简化与DICOM格式数据集的工作。这里我们介绍高二COM图书馆,该图书馆为Python编程语言提供了一个高层次的临床编程界面,其中摘述了标准的低层次细节,便于在Python代码的几行中对DICOM格式的图像信息进行编码和代码化与通信。高二图书馆与广泛的PythonM生态系统用于图像处理和机器学习的宽广的Mython生态系统联系。同时,简化了DICOM合规文件的创建和分类,高二科之间实现了与医疗成像系统的互操作性,该系统将数据用于培训和运行Mython编程的低级编程程序,最终通过ML模型和存储和存储这些模型,我们用来进行磁化和升级的模型,并演示了这些系统,我们通过这些模型和升级的模型,从而演示和升级了这些系统,从而演示了这些系统,从而演示了这些系统,从而演示了ML的模型和存储和存储和存储和存储和存储和储存。