This report addresses the technical aspects of de-identification of medical images of human subjects and biospecimens, such that re-identification risk of ethical, moral, and legal concern is sufficiently reduced to allow unrestricted public sharing for any purpose, regardless of the jurisdiction of the source and distribution sites. All medical images, regardless of the mode of acquisition, are considered, though the primary emphasis is on those with accompanying data elements, especially those encoded in formats in which the data elements are embedded, particularly Digital Imaging and Communications in Medicine (DICOM). These images include image-like objects such as Segmentations, Parametric Maps, and Radiotherapy (RT) Dose objects. The scope also includes related non-image objects, such as RT Structure Sets, Plans and Dose Volume Histograms, Structured Reports, and Presentation States. Only de-identification of publicly released data is considered, and alternative approaches to privacy preservation, such as federated learning for artificial intelligence (AI) model development, are out of scope, as are issues of privacy leakage from AI model sharing. Only technical issues of public sharing are addressed.
翻译:该报告涉及医学图像去识别的技术方面,将人类受试者和生物标本的图像加以去识别,以使伦理、道德和法律风险足够降低,以便无论来源和分发站点的司法管辖区如何,都可以无限制地公开共享。所有医学图像,无论采集方式如何,均被视为同等重要,尤其是那些带有相关数据元素的图像,特别是数字成像与通信医学 (DICOM) 中嵌入数据元素的格式。这些图像包括像图像一样的对象,如分段、参数映射和放射治疗 (RT) 剂量对象。范围也包括相关的非图像对象,如 RT 结构集、计划和剂量体积直方图、结构化报告和显示状态。只考虑发布后的数据去识别,联合学习等其他隐私保护方法,例如为人工智能 (AI) 模型开发而共享等问题不在范围内,AI 模型共享隐私泄露的问题也不在范围内。只解决公开共享的技术问题。