Objectives: To assess the use of artificial intelligence-based software in ruling out chest X-ray cases, with no significant findings in a primary health care setting. Methods: In this retrospective study, a commercially available artificial intelligence (AI) software was used to analyse 10 000 chest X-rays of Finnish primary health care patients. In studies with a mismatch between an AI normal report and the original radiologist report, a consensus read by two board-certified radiologists was conducted to make the final diagnosis. Results: After the exclusion of cases not meeting the study criteria, 9579 cases were analysed by AI. Of these cases, 4451 were considered normal in the original radiologist report and 4644 after the consensus reading. The number of cases correctly found nonsignificant by AI was 1692 (17.7% of all studies and 36.4% of studies with no significant findings). After the consensus read, there were nine confirmed false-negative studies. These studies included four cases of slightly enlarged heart size, four cases of slightly increased pulmonary opacification and one case with a small unilateral pleural effusion. This gives the AI a sensitivity of 99.8% (95% CI= 99.65-99.92) and specificity of 36.4 % (95% CI= 35.05-37.84) for recognising significant pathology on a chest X-ray. Conclusions: AI was able to correctly rule out 36.4% of chest X-rays with no significant findings of primary health care patients, with a minimal number of false negatives that would lead to effectively no compromise on patient safety. No critical findings were missed by the software.
翻译:目标:评估人工智能软件在排除胸腔X光病例方面的使用情况,在初级保健环境中没有重大发现。方法:在这项回顾性研究中,利用商业上可用的人工智能软件分析芬兰初级保健病人的10 000个胸腔X光片。在一项AI正常报告与原放射学家报告不匹配的研究中,两名董事会认证的放射学家为最终诊断进行了协商一致。结果:在排除不符合研究标准的病例后,AI分析了9579个病例。在这些病例中,最初的放射学家报告认为4451例正常,在协商一致阅读后认为46444例。在AI正确发现的不重要病例中,有1692例(占所有研究的17.7%,占研究的36.4%)。在阅读共识后,有9项经证实的虚假否定性研究,其中包括4个心脏稍大病例、4个轻微不准确的肺部诊断病例和1个通过小规模单体液折损的病人病例。这给AI带来99.8%的敏感度(95-CI=65)基本病理学结论。