X-ray images may present non-trivial features with predictive information of patients that develop severe symptoms of COVID-19. If true, this hypothesis may have practical value in allocating resources to particular patients while using a relatively inexpensive imaging technique. The difficulty of testing such a hypothesis comes from the need for large sets of labelled data, which need to be well-annotated and should contemplate the post-imaging severity outcome. This paper presents an original methodology for extracting semantic features that correlate to severity from a data set with patient ICU admission labels through interpretable models. The methodology employs a neural network trained to recognise lung pathologies to extract the semantic features, which are then analysed with low-complexity models to limit overfitting while increasing interpretability. This analysis points out that only a few features explain most of the variance between patients that developed severe symptoms. When applied to an unrelated larger data set with pathology-related clinical notes, the method has shown to be capable of selecting images for the learned features, which could translate some information about their common locations in the lung. Besides attesting separability on patients that eventually develop severe symptoms, the proposed methods represent a statistical approach highlighting the importance of features related to ICU admission that may have been only qualitatively reported. While handling limited data sets, notable methodological aspects are adopted, such as presenting a state-of-the-art lung segmentation network and the use of low-complexity models to avoid overfitting. The code for methodology and experiments is also available.
翻译:X射线图象可能呈现出非三角性特征,其预测信息显示有严重COVID-19症状的患者的预测信息。如果确实如此,这一假设可能具有实际价值,在使用相对廉价的成像技术的同时,向特定患者分配资源。检验这种假设的困难在于需要大量贴标签的数据,这些数据需要说明性强,并应考虑成形后的严重程度结果。本文件介绍了一种原始方法,从一个数据组中提取与通过可解释模型获得的病人IPU入院标签的严重性相关联的语义特征。该方法使用经培训的神经网络来识别肺部病症,以提取语义特征,然后用低兼容性模型加以分析,以限制超编,同时增加可解释性。这一分析指出,只有少数几个特征可以解释出患者之间出现严重症状的大部分差异。当应用一个与病理学临床记录无关的较大数据集时,该方法显示能够为所学特征选择图像,可以翻译有关其肺部常见位置的一些信息。该方法除了测试病人的肺部病理性偏差性外,最后还用低的复度模型分析,因此只能表明具有严重CU性特征的生理分位特征,因此,拟议的统计方法显示,而采用明显分系的分位方法表明,但统计系系的分系的分解方法的分解方法可能只是测测测测测测测测测测。