图像CHD:用于遗传心脏病分类的3D计算表层摄影图像数据集 (ImageCHD: A 3D Computed Tomography Image Dataset for Classification of Congenital Heart Disease)

Congenital heart disease (CHD) is the most common type of birth defect, which occurs 1 in every 110 births in the United States. CHD usually comes with severe variations in heart structure and great artery connections that can be classified into many types. Thus highly specialized domain knowledge and the time-consuming human process is needed to analyze the associated medical images. On the other hand, due to the complexity of CHD and the lack of dataset, little has been explored on the automatic diagnosis (classification) of CHDs. In this paper, we present ImageCHD, the first medical image dataset for CHD classification. ImageCHD contains 110 3D Computed Tomography (CT) images covering most types of CHD, which is of decent size Classification of CHDs requires the identification of large structural changes without any local tissue changes, with limited data. It is an example of a larger class of problems that are quite difficult for current machine-learning-based vision methods to solve. To demonstrate this, we further present a baseline framework for the automatic classification of CHD, based on a state-of-the-art CHD segmentation method. Experimental results show that the baseline framework can only achieve a classification accuracy of 82.0\% under a selective prediction scheme with 88.4\% coverage, leaving big room for further improvement. We hope that ImageCHD can stimulate further research and lead to innovative and generic solutions that would have an impact in multiple domains. Our dataset is released to the public compared with existing medical imaging datasets.

翻译：遗传性心脏病(CHD)是最常见的出生缺陷类型,在美国每110名新生儿中出现1例,在110名新生儿中出现1例。CHD通常在心脏结构上出现严重差异,而且大动脉连接可分为多种类型。因此,需要高度专业化的领域知识和耗时的人类过程来分析相关的医疗图像。另一方面,由于CHD的复杂性和缺乏数据集,很少探讨CHD的自动诊断(分类)问题。在本文中,我们提供了通用的图像CHD,这是CHD分类的第一个医学图像数据集。图像CHD包含110 3D复合成像学(CT)图像,覆盖了大多数类型的CHD,而CHD分类是相当规模的,因此需要通过高专业领域知识和耗时费的人类过程来分析相关的医学图像。另一方面,由于CHD的自动诊断(分类)的分类(分类)很少,因此,对于目前基于机器学习的视觉方法很难解决的问题。为了证明这一点,我们进一步展示了CHDD的自动分类的基线框架,以CHDD的状态为标准,只有CHDDD82-88比较的多重分析域,而使得我们的图像预测的模型的精确化数据能够实现一个基础化的升级的模型的升级化计划。我们根据一个基础,我们可以进一步的实验室的实验室的实验室的实验室的分类,我们可以实现一个基础化的分类。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日