Deep learning has been shown to accurately assess 'hidden' phenotypes and predict biomarkers from medical imaging beyond traditional clinician interpretation of medical imaging. Given the black box nature of artificial intelligence (AI) models, caution should be exercised in applying models to healthcare as prediction tasks might be short-cut by differences in demographics across disease and patient populations. Using large echocardiography datasets from two healthcare systems, we test whether it is possible to predict age, race, and sex from cardiac ultrasound images using deep learning algorithms and assess the impact of varying confounding variables. We trained video-based convolutional neural networks to predict age, sex, and race. We found that deep learning models were able to identify age and sex, while unable to reliably predict race. Without considering confounding differences between categories, the AI model predicted sex with an AUC of 0.85 (95% CI 0.84 - 0.86), age with a mean absolute error of 9.12 years (95% CI 9.00 - 9.25), and race with AUCs ranging from 0.63 - 0.71. When predicting race, we show that tuning the proportion of a confounding variable (sex) in the training data significantly impacts model AUC (ranging from 0.57 to 0.84), while in training a sex prediction model, tuning a confounder (race) did not substantially change AUC (0.81 - 0.83). This suggests a significant proportion of the model's performance on predicting race could come from confounding features being detected by AI. Further work remains to identify the particular imaging features that associate with demographic information and to better understand the risks of demographic identification in medical AI as it pertains to potentially perpetuating bias and disparities.
翻译:深层的学习显示可以准确评估“ 隐藏” 苯菌型, 并预测医学成像中的生物标志, 而不是传统临床对医学成像的解读。 鉴于人工智能(AI) 模型的黑盒性质, 在将模型应用于医疗时应该谨慎行事, 因为预测任务可能因疾病和病人人口分布差异的差别而短小。 使用两个医疗体系的大型回声心电图数据集, 我们测试是否有可能使用深度学习算法从心脏超声波图像中预测年龄、 种族和性别, 并评估各种混杂变异变量的影响。 我们训练了基于视频的神经神经神经网络以预测年龄、性别和种族特征。 我们发现深层次的学习模型能够识别年龄和性别特征,但又无法可靠地预测种族。 在不考虑类别差异的情况下, AI模型预测性别的性别变化与9.12年的绝对误差(95% CI-9.00- 9.25) 年龄, 与AUC 的种族运动从0.63到0.71 不等。 在预测种族运动的预测中,我们发现这种变异性数据比例从0.78 的模型到更精确的模型,我们从分析了对A 的性别的精确的精确的性别分析, 的精确分析, 的精确的精确的性别分析是分析, 从0.7 的精确的精确的精确的精确的精确的精确分析, 。